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Preface 


Partial differential equations are a many-faceted subject. Created to describe the 
mechanical behavior of objects such as vibrating strings and blowing winds, it 
has developed into a body of material that interacts with many branches of 
mathematics, such as differential geometry, complex analysis, and harmonic 
analysis, as well as a ubiquitous factor in the description and elucidation of 
problems in mathematical physics. 

This work is intended to provide a course of study of some of the major 
aspects of PDE. It is addressed to readers with a background in the basic intro- 
ductory graduate mathematics courses in American universities: elementary real 
and complex analysis, differential geometry, and measure theory. 

Chapter | provides background material on the theory of ordinary differential 
equations (ODE). This includes both very basic material—on topics such as the 
existence and uniqueness of solutions to ODE and explicit solutions to equations 
with constant coefficients and relations to linear algebra—and more sophisticated 
results—on flows generated by vector fields, connections with differential geom- 
etry, the calculus of differential forms, stationary action principles in mechanics, 
and their relation to Hamiltonian systems. We discuss equations of relativistic 
motion as well as equations of classical Newtonian mechanics. There are also 
applications to topological results, such as degree theory, the Brouwer fixed-point 
theorem, and the Jordan-Brouwer separation theorem. In this chapter, we also 
treat scalar first-order PDE, via the Hamilton—Jacobi theory. 

Chapters 2—6 constitute a survey of basic linear PDE. Chapter 2 begins with 
the derivation of some equations of continuum mechanics in a fashion similar to 
the derivation of ODE in mechanics in Chap. 1, via variational principles. We 
obtain equations for vibrating strings and membranes; these equations are not 
necessarily linear, and hence they will also provide sources of problems later, 
when nonlinear PDE is taken up. Further material in Chap. 2 centers around the 
Laplace operator, which on Euclidean space R” is 
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and the linear wave equation, 
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We also consider the Laplace operator on a general Riemannian manifold and the 
wave equation on a general Lorentz manifold. We discuss the basic consequences 
of Green’s formula, including energy conservation and finite propagation speed 
for solutions to linear wave equations. We also discuss Maxwell’s equations for 
electromagnetic fields and their relation with special relativity. Before we can 
establish general results on the solvability of these equations, it is necessary to 
develop some analytical techniques. This is done in the next couple of chapters. 

Chapter 3 is devoted to Fourier analysis and the theory of distributions. These 
topics are crucial for the study of linear PDE. We give a number of basic 
applications to the study of linear PDE with constant coefficients. Among these 
applications are results on harmonic and holomorphic functions in the plane, 
including a short treatment of elementary complex function theory. We derive 
explicit formulas for solutions to Laplace and wave equations on Euclidean 
space, and also the heat equation, 


Ou 
3 = 
(3) r Au = 0. 


We also produce solutions on certain subsets, such as rectangular regions, using 
the method of images. We include material on the discrete Fourier transform, 
germane to the discrete approximation of PDE, and on the fast evaluation of this 
transform, the FFT. Chapter 3 is the first chapter to make extensive use of 
functional analysis. Basic results on this topic are compiled in Appendix A, 
Outline of Functional Analysis. 

Sobolev spaces have proven to be a very effective tool in the existence theory 
of PDE, and in the study of regularity of solutions. In Chap. 4 we introduce 
Sobolev spaces and study some of their basic properties. We restrict attention to 
L?-Sobolev spaces, such as HR"), which consists of L? functions whose 
derivatives of order < k (defined in a distributional sense, in Chap. 3) belong to 
L?(R"), when k is a positive integer. We also replace k by a general real number 
s. The L?-Sobolev spaces, which are very useful for nonlinear PDE, are treated 
later, in Chap. 13. 

Chapter 5 is devoted to the study of the existence and regularity of solutions to 
linear elliptic PDE, on bounded regions. We begin with the Dirichlet problem for 
the Laplace operator, 


(4) Au = fonQ, w= gonad, 
and then treat the Neumann problem and various other boundary problems, 


including some that apply to electromagnetic fields. We also study general 
boundary problems for linear elliptic operators, giving a condition that guarantees 
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regularity and solvability (perhaps given a finite number of linear conditions on 
the data). Also in Chap. 5 are some applications to other areas, such as a proof 
of the Riemann mapping theorem, first for smooth simply connected domains in 
the complex plane C, then, after a treatment of the Dirichlet problem for the 
Laplace operator on domains with rough boundary, for general simply connected 
domains in C. We also develop the Hodge theory and apply it to de Rham 
cohomology, extending the study of topological applications of differential forms 
begun in Chap. 1. 

In Chap. 6 we study linear evolution equations, in which there is a “time” 
variable ft, and initial data are given at t = 0. We discuss the heat and wave 
equations. We also treat Maxwell’s equations, for an electromagnetic field, and 
more general hyperbolic systems. We prove the Cauchy—Kowalewsky theorem, 
in the linear case, establishing local solvability of the Cauchy initial value 
problem for general linear PDE with analytic coefficients, and analytic data, as 
long as the initial surface is “noncharacteristic.” The nonlinear case is treated in 
Chap. 16. Also in Chap. 6 we treat geometrical optics, providing approximations 
to solutions of wave equations whose initial data either are highly oscillatory or 
possess simple singularities, such as a jump across a smooth hypersurface. 

Chapters 1-6, together with Appendix A and Appendix B, Manifolds, Vector 
Bundles, and Lie Groups, make up the first volume of this work. The second 
volume consists of Chaps. 7—12, covering a selection of more advanced topics in 
linear PDE, together with Appendix C, Connections and Curvature. 

Chapter 7 deals with pseudodifferential operators (~DOs). This class of 
operators includes both differential operators and parametrices of elliptic opera- 
tors, that is, inverses modulo smoothing operators. There is a “symbol calculus” 
allowing one to analyze products of DOs, useful for such a parametrix con- 
struction. The L?-boundedness of operators of order zero and the Garding 
inequality for elliptic DOs with positive symbol provide very useful tools in 
linear PDE, which will be used in many subsequent chapters. 

Chapter 8 is devoted to spectral theory, particularly for self-adjoint elliptic 
operators. First we give a proof of the spectral theorem for general self-adjoint 
operators on Hilbert space. Then we discuss conditions under which a differential 
operator yields a self-adjoint operator. We then discuss the asymptotic distribu- 
tion of eigenvalues of the Laplace operator on a bounded domain, making use of 
a construction of a parametrix for the heat equation from Chap. 7. Further 
material in Chap. 8 includes results on the spectral behavior of various specific 
differential operators, such as the Laplace operator on a sphere, and on hyperbolic 
space, the “harmonic oscillator” 


() —A+|z), 


and the operator 
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[2 
which arises in the simplest quantum mechanical model of the hydrogen atom. 
We also consider the Laplace operator on cones. 

In Chap. 9 we study the scattering of waves by a compact obstacle K in R°. 
This scattering theory is to some degree an extension of the spectral theory of the 
Laplace operator on R°\K, with the Dirichlet boundary condition. In addition to 
studying how a given obstacle scatters waves, we consider the inverse problem: 
how to determine an obstacle given data on how it scatters waves. 

Chapter 10 is devoted to the Atiyah—Singer index theorem. This gives a for- 
mula for the index of an elliptic operator D on a compact manifold M, defined by 


(7) Index D = dim ker D — dim ker D*. 


We establish this formula, which is an integral over M of a certain differential 
form defined by a pair of “curvatures,” when D is a first-order differential 
operator of “Dirac type,” a class that contains many important operators arising 
from differential geometry and complex analysis. Special cases of such a formula 
include the Chern—Gauss—Bonnet formula and the Riemann—Roch formula. We 
also discuss the significance of the latter formula in the study of Riemann 
surfaces. 

In Chap. 11 we study Brownian motion, described mathematically by Wiener 
measure on the space of continuous paths in R"”. This provides a probabilistic 
approach to diffusion and it both uses and provides new tools for the analysis 
of the heat equation and variants, such as 


(8) — = —Au+ Vu, 


where V is a real-valued function. There is an integral formula for solutions to (8), 
known as the Feynman—Kac formula; it is an integral over path space with respect 
to the Wiener measure, of a fairly explicit integrand. We also derive an analogous 
integral formula for solutions to 


(9) au = —Au+ Xu, 


where X is a vector field. In this case, another tool is involved in constructing the 
integrand, the stochastic integral. We also study stochastic differential equations 
and applications to more general diffusion equations. 

In Chap. 12 we tackle the 0-Neumann problem, a boundary problem for an 
elliptic operator (essentially the Laplace operator) on a domain Q C C”, which is 
very important in the theory of functions of several complex variables. From a 
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technical point of view, it is of particular interest that this boundary problem does 
not satisfy the regularity criteria investigated in Chap. 5. If Q is “strongly 
pseudo-convex,” one has instead certain “subelliptic estimates,” which are 
established in Chap. 12. 

The third and final volume of this work contains Chaps. 13-18. It is here that 
we study nonlinear PDE. 

We prepare the way in Chap. 13 with a further development of function space 
and operator theory, for use in nonlinear analysis. This includes the theory of 
L?-Sobolev spaces and Holder spaces. We derive estimates in these spaces on 
nonlinear functions F(u), known as “Moser estimates,” which are very useful. We 
extend the theory of pseudodifferential operators to cases where the symbols have 
limited smoothness, and also develop a variant of DO theory, the theory of 
“paradifferential operators,” which has had a significant impact on nonlinear PDE 
since about 1980. We also estimate these operators, acting on the function spaces 
mentioned above. Other topics treated in Chap. 13 include Hardy spaces, com- 
pensated compactness, and “fuzzy functions.” 

Chapter 14 is devoted to nonlinear elliptic PDE, with an emphasis on 
second-order equations. There are three successive degrees of nonlinearity: 
semilinear equations, such as 


(10) Au = F(x,u, Vu), 

quasi-linear equations, such as 

(11) S¢ a! *(a, u, Vu)d;Onu = F(az,u, Vu), 

and completely nonlinear equations, of the form 

(12) G(x, D?u) = 0. 

Differential geometry provides a rich source of such PDE, and Chap. 14 contains 
a number of geometrical applications. For example, to deform conformally a 
metric on a surface so its Gauss curvature changes from k(x) to K(x), one needs to 
solve the semilinear equation 


(13) Au = k(x) — K(x)e™. 


As another example, the graph of a function y = u(x) is a minimal submanifold of 
Euclidean space provided u solves the quasi-linear equation 


(14) (1+ |Vul”)Au+ (Vu) - H(u)(Vu) = 0, 
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called the minimal surface equation. Here, H(u) = (0j0,w) is the Hessian matrix 
of u. On the other hand, this graph has Gauss curvature K(x) provided u solves the 
completely nonlinear equation 


(15) det H(u) = K(a)(14|Vul?)°t??”, 


a Monge—Ampére equation. Equations (13)—(15) are all scalar, and the maximum 
principle plays a useful role in the analysis, together with a number of other tools. 
Chapter 14 also treats nonlinear systems. Important physical examples arise in 
studies of elastic bodies, as well as in other areas, such as the theory of liquid 
crystals. Geometric examples of systems considered in Chap. 14 include equa- 
tions for harmonic maps and equations for isometric embeddings of a 
Riemannian manifold in Euclidean space. 

In Chap. 15, we treat nonlinear parabolic equations. Partly echoing Chap. 14, 
we progress from a treatment of semilinear equations, 


Ou 
(16) ao Lu+ F(x,u, Vu), 
where L is a linear operator, such as L = A, to a treatment of quasi-linear 
equations, such as 


a ss 
(17) Fr Det, u)Oeut X(u). 


(We do very little with completely nonlinear equations in this chapter.) We study 
systems as well as scalar equations. The first application of (16) we consider is to 
the parabolic equation method of constructing harmonic maps. We also consider 
“reaction—diffusion” equations, ¢ x & systems of the form (16), in which 
F(a,u, Vu) = X(u), where X is a vector field on R‘, and L is a diagonal 
operator, with diagonal elements a;A, a; >0. These equations arise in mathe- 
matical models in biology and in chemistry. For example, u = (u1,---, ue) might 
represent the population densities of each of @ species of living creatures, dis- 
tributed over an area of land, interacting in a manner described by X and diffusing 
in a manner described by a,A. If there is a nonlinear (density-dependent) diffu- 
sion, one might have a system of the form (17). 

Another problem considered in Chap. 15 models the melting of ice; one has a 
linear heat equation in a region (filled with water) whose boundary (where the 
water touches the ice) is moving (as the ice melts). The nonlinearity in the 
problem involves the description of the boundary. We confine our analysis to a 
relatively simple one-dimensional case. 

Nonlinear hyperbolic equations are studied in Chap. 16. Here continuum 
mechanics is the major source of examples, and most of them are systems, rather 
than scalar equations. We establish local existence for solutions to first-order 
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hyperbolic systems, which are either “symmetric” or “symmetrizable.” 
An example of the latter class is the following system describing compressible 
fluid flow: 


Ov 1 Op 
18 oP + Vv + —gradp=0, 
(18) At +V,u+ 5a D AE 


+V p+ p divu =0, 

for a fluid with velocity v, density p, and pressure p, assumed to satisfy a relation 
p = p(p), called an “equation of state.” Solutions to such nonlinear systems tend 
to break down, due to shock formation. We devote a bit of attention to the study 
of weak solutions to nonlinear hyperbolic systems, with shocks. 

We also study second-order hyperbolic systems, such as systems for a 
k-dimensional membrane vibrating in R”, derived in Chap. 2. Another topic 
covered in Chap. 16 is the Cauchy—Kowalewsky theorem, in the nonlinear case. 
We use a method introduced by P. Garabedian to transform the Cauchy problem 
for an analytic equation into a symmetric hyperbolic system. 

In Chap. 17 we study incompressible fluid flow. This is governed by the Euler 
equation 


a 
(19) Or +V,v=~—gradp, divv =0, 


in the absence of viscosity, and by the Navier-Stokes equation 


(20) a +V,v=vLv—gradp, divv=0, 
in the presence of viscosity. Here £ is a second-order operator, the Laplace 
operator for a flow on flat space; the “viscosity” v is a positive quantity. Equation 
(19) shares some features with quasi-linear hyperbolic systems, though there are 
also significant differences. Similarly, (20) has a lot in common with semilinear 
parabolic systems. 

Chapter 18, the last chapter of this work, is devoted to Einstein’s gravitational 
equations: 


(21) Giz = 82K Tix. 


Here G;;; is the Einstein tensor, given by Gj; = Ricj, — (1/2) gjx, where Ric;, 
is the Ricci tensor and S the scalar curvature, of a Lorentz manifold (or 
“spacetime”) with metric tensor g;;. On the right side of (21), T)j;, is the stress— 
energy tensor of the matter in the spacetime, and k is a positive constant, which 
can be identified with the gravitational constant of the Newtonian theory of 
gravity. In local coordinates, G';;, has a nonlinear expression in terms of gj; and 
its second-order derivatives. In the empty-space case, where Tj; = 0, (21) is a 
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quasi-linear second-order system for g;;. The freedom to change coordinates 
provides an obstruction to this equation being hyperbolic, but one can impose the 
use of “harmonic” coordinates as a constraint and transform (21) into a hyper- 
bolic system. In the presence of matter one couples (21) to other systems, 
obtaining more elaborate PDE. We treat this in two cases, in the presence of an 
electromagnetic field, and in the presence of a relativistic fluid. 

In addition to the 18 chapters just described, there are three appendices, 
already mentioned above. Appendix A gives definitions and basic properties of 
Banach and Hilbert spaces (of which L’-spaces and Sobolev spaces are exam- 
ples), Fréchet spaces (such as C®(IR”)), and other locally convex spaces (such as 
spaces of distributions). It discusses some basic facts about bounded linear 
operators, including some special properties of compact operators, and also 
considers certain classes of unbounded linear operators. This functional analytic 
material plays a major role in the development of PDE from Chap. 3 onward. 

Appendix B gives definitions and basic properties of manifolds and vector 
bundles. It also discusses some elementary properties of Lie groups, including a 
little representation theory, useful in Chap. 8, on spectral theory, as well as in the 
Chern—Weil construction. 

Appendix C, Connections and Curvature, contains material of a differential 
geometric nature, crucial for understanding many things done in Chaps. 10-18. 
We consider connections on general vector bundles, and their curvature. We 
discuss in detail the special properties of the primary case: the Levi—Civita 
connection and Riemann curvature tensor on a Riemannian manifold. We discuss 
the basic properties of the geometry of submanifolds, relating the second fun- 
damental form to curvature via the Gauss—Codazzi equations. We describe how 
vector bundles arise from principal bundles, which themselves carry various 
connections and curvature forms. We then discuss the Chern—Weil construction, 
yielding certain closed differential forms associated to curvatures of connections 
on principal bundles. We give several proofs of the classical Gauss—Bonnet 
theorem and some related results on two-dimensional surfaces, which are useful 
particularly in Chaps. 10 and 14. We also give a geometrical proof of the Chern— 
Gauss—Bonnet theorem, which can be contrasted with the proof in Chap. 10, as a 
consequence of the Atiyah—Singer index theorem. 

We mention that, in addition to these “global” appendices, there are appendices 
to some chapters. For example, Chap. 3 has an appendix on the gamma function. 
Chapter 6 has two appendices; Appendix A has some results on Banach spaces of 
harmonic functions useful for the proof of the linear Cauchy—Kowalewsky theo- 
rem, and Appendix B deals with the stationary phase formula, useful for the study 
of geometrical optics in Chap. 6 and also for results later, in Chap. 9. There are 
other chapters with such “local” appendices. Furthermore, there are two sections, 
both in Chap. 14, with appendices. Section 6, on minimal surfaces, has a com- 
panion, Sect. 6B, on the second variation of area and consequences, and Sect. 12, 
on nonlinear elliptic systems, has a companion, Sect. 12B, with complementary 
material. 
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Having described the scope of this work, we find it necessary to mention a 
number of topics in PDE that are not covered here or are touched on only very 
briefly. 

For example, we devote little attention to the real analytic theory of PDE. We 
note that harmonic functions on domains in R” are real analytic, but we do not 
discuss the analyticity of solutions to more general elliptic equations. We do 
prove the Cauchy—Kowalewsky theorem, on analytic PDE with analytic Cauchy 
data. We derive some simple results on unique continuation from these few 
analyticity results, but there is a large body of lore on unique continuation, for 
solutions to nonanalytic PDE, neglected here. 

There is little material on numerical methods. There are a few references to 
applications of the FFT and of “splitting methods.” Difference schemes for PDE 
are mentioned just once, in a set of exercises on scalar conservation laws. Finite 
element methods are neglected, as are many other numerical techniques. 

There is a large body of work on free boundary problems, but the only one 
considered here is a simple one-space dimensional problem, in Chap. 15. 

While we have considered a variety of equations arising from classical physics 
and from relativity, we have devoted relatively little attention to quantum 
mechanics. We have considered a few quantum systems in Chap. 8, including 
models of the hydrogen atom and the deuteron. Also, there are some exercises on 
potential scattering mentioned in Chap. 9. However, the physical theories behind 
these equations are not discussed here. 

There are a number of nonlinear evolution equations, such as the Korteweg— 
deVries equation, that have been perceived to provide infinite dimensional ana- 
logues of completely integrable Hamiltonian systems, and to arise “universally” 
in asymptotic analyses of solutions to various nonlinear wave equations. They are 
not here. Nor is there a treatment of the Yang—Mills equations for gauge fields, 
with their wonderful applications to the geometry and topology of 
four-dimensional manifolds. 

Of course, this is not a complete list of omitted material. One can go on and on 
listing important topics in this vast subject. The author can at best hope that the 
reader will find it easier to understand many of these topics with this book, than 
without it. 
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Introduction to the Second Edition 


In addition to making numerous small corrections to this work, collected over the 
past dozen years, I have taken the opportunity to make some very significant 
changes, some of which broaden the scope of the work, some of which clarify 
previous presentations, and a few of which correct errors that have come to my 
attention. 

There are seven additional sections in this edition, two in Volume 1, two in 
Volume 2, and three in Volume 3. Chapter 4 has a new section, “Sobolev spaces 
on rough domains,” which serves to clarify the treatment of the Dirichlet problem 
on rough domains in Chap. 5. Chapter 6 has a new section, “Boundary layer 
phenomena for the heat equation,” which will prove useful in one of the new 
sections in Chap. 17. Chapter 7 has a new section, “Operators of harmonic 
oscillator type,” and Chap. 10 has a section that presents an index formula for 
elliptic systems of operators of harmonic oscillator type. Chapter 13 has a new 
appendix, “Variations on complex interpolation,” which has material that is 
useful in the study of Zygmund spaces. Finally, Chap. 17 has two new sections, 
“Vanishing viscosity limits” and “From velocity convergence to flow 
convergence.” 

In addition, several other sections have been substantially rewritten, and 
numerous others polished to reflect insights gained through the use of these books 
over time. 


Introduction to the Third Edition 


I have provided further polishings and supplements for this third edition. New 
material in Volume | includes a section on rigid body motion in Chapter 1, which 
will tie in to the derivation of the Euler equation of incompressible fluid flow in 
Chapter 17. Chapter 3 has a new appendix on the central limit theorem, related to 
a random walk, which will tie in to the treatment of Brownian motion in 
Chapter 11. In addition there is an expanded treatment of the Poisson integral in 
Chapter 5, a section on the Schrédinger equation in Chapter 6, and an expanded 
treatment of holomorphic functional calculus in Appendix A. 
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New material in Volume 2 includes sections on a quantum model of the 
deuteron, a quantum adiabatic theorem, and a quantum ergodic theorem, and 
appendices on the classical ergodic theorem and on shifted wave equations in 
Chapter 8, as well as expanded treatments of the spectral theorem and of analysis 
on hyperbolic space in that chapter. In Chapter 11 I have added a section on 
diffusion on Riemannian manifolds, with application to models of relativistic 
diffusion. 

New material in Volume 3 includes a section on overdetermined elliptic 
systems in Chapter 14 and a section on Euler flows on rotating surfaces, influ- 
enced by the Coriolis force, in Chapter 17. 


Chapel Hill, USA Michael E. Taylor 
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Function Space and Operator Theory 
for Nonlinear Analysis 


Introduction 


This chapter examines a number of analytical techniques, which will be applied 
to diverse nonlinear problems in the remaining chapters. For example, we 
study Sobolev spaces based on L?, rather than just L?. Sections 1 and 2 discuss 
the definition of Sobolev spaces H k.P for k € Zt, and inclusions of the form 
H*? Cc LIL‘. Estimates based on such inclusions have refined forms, due to 
E. Gagliardo and L. Nirenberg. We discuss these in § 3, together with results of 
J. Moser on estimates on nonlinear functions of an element of a Sobolev space, 
and on commutators of differential operators and multiplication operators. 
In §4 we establish some integral estimates of N. Trudinger, on functions in 
Sobolev spaces for which L°-bounds just fail. In these sections we use such 
basic tools as Hélder’s inequality and integration by parts. 

Applying the Fourier transform to analysis on L? is a more subtle affair when 
p # 2. One result that does often serve when, in the L?-theory, one could appeal to 
the Plancherel theorem, is Mikhlin’s Fourier multiplier theorem, presented in § 5. 
This is followed by a development of Calderon-Zygmund theory and Littlewood- 
Paley theory. This theory enables interpolation theory to be applied to the study 
of the spaces H*’”, for noninteger s, in § 6. In § 7 we apply some of this material 
to the study of L?-spectral theory of the Laplace operator, on compact manifolds, 
possibly with boundary. 

In § 8 we study spaces C” of Hélder continuous functions, and their relation 
with Zygmund spaces C7. We derive estimates in these spaces for solutions to 
elliptic boundary problems. 

The next two sections extend results on pseudodifferential operators, intro- 
duced in Chap.7. Section 9 considers symbols p(x, €) with minimal regularity 
in x. We derive both L?- and Hdélder estimates. Section 10 considers paradiffer- 
ential operators, a variant of pseudodifferential operator calculus particularly well 
suited to nonlinear analysis. Sections 9 and 10 are largely taken from [T2]. 

In § 11 we consider “fuzzy functions,” consisting of a pair (f,), where f is 
a function on a space 22 and X is a measure on 22 x R, with the property that 
Jf ye(2) dX(x,y) = [ v(x) f(x) dx. The measure \ is known as a Young mea- 
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sure. It incorporates information on how f may have arisen as a weak limit of 
smooth (“sharply defined”) functions, and it is useful for analyses of nonlinear 
maps that do not generally preserve weak convergence. 

In § 12 there is a brief discussion of Hardy spaces, subspaces of L1(IR”) with 
many desirable properties, only a few of which are discussed here. Much more on 
this topic can be found in [S3], but material covered here will be useful for some 
elliptic regularity results in § 12B of Chap. 14. 

We end this chapter with Appendix A, discussing variants of the complex in- 
terpolation method introduced in Chap. 4 and used a lot in the early sections of 
this chapter. It turns out that slightly different complex interpolation functors are 
better suited to the scale of Zygmund spaces. 


1. L?-Sobolev spaces 


Let p € [1, co). In analogy with the definition of the Sobolev spaces in Chap. 4, 
we set, fork = 0,1,2,..., 


(1) H*P(R") = {ue LP(R") : D®u € L?(R") for |a| < k}. 


It is easy to see that S(IR”) is dense in each space H*-?(R”), with its natural norm 


(1.2) lull zee = > ]D°ullze. 


lal<k 


For p # 2, we cannot characterize the spaces H*’?(IR) conveniently in terms of 
the Fourier transform. It is still possible to define spaces H*?(IR”) by interpola- 
tion; we will examine this in § 6. Here we will consider only the spaces H*?(R") 
with k a nonnegative integer. 

The chain rule allows us to say that if x : R" — R” is a diffeomorphism that 
is linear outside a compact set, then y* : H*?(IR"”) + H*?(IR"). Also multipli- 
cation by an element y € C5°(R”) maps H*-?(R”) to itself. This allows us to 
define H*:?(M) for a compact manifold M via a partition of unity subordinate to 
a coordinate chart. Also, for compact M, if we define Diff” (M) to be the set of 
differential operators of order < k on M, with smooth coefficients, then 


(1.3)  H*?(M) = {ue L?(M) : Pu € L?(M) forall P € Diff*(M)}. 


We can define H*?(R") as in (1.1), with R” replaced by R". The exten- 
sion operator defined by (4.2)-(4.4) of Chap. 4 also works to produce extension 
maps E : H™?(R") — H*?(R"). Similarly, if M is a compact manifold 
with smooth boundary, with double NV, we can define H**?(M) via coordinate 
charts and the notion of H*:? (R'L), or by (1.3), and we have extension operators 
E: H®?(M) > H*®?(N). 


Exercises 3 


We also note the obvious fact that 
(1.4) D® : H®?(R") —> H*-leLP(R), 
for |a| < k, and 
(1.5) P: H®?(M) — H*-*?(M) if P € Diff’(M), 


provided @ < k. 


Exercises 


1. A Friedrichs mollifier on R” is a family of smoothing operators J-u(x%) = je * u(x) 
where 


je(z) =e "j(e*2), [i@ae =1, j€S(R”). 
Equivalently, Jeu(x) = p(eD)u(x), » € S(R”), y(O) = 1. Show that, for each 
€ [l,0o0), kEZ*, 
Je: H*P(R") — (1) H°?(R"), 
L<0o 
for each € > 0, and 
Jeu—u_ in H*?(R") 


ase > Oifuc H*?(R"). 
2. Suppose A € C1(R”), with ||Al|oi = SUP}q|<1 || D° Allis. Show that when Jz is a 
Friedrichs mollifier as above, then 


I|[A, Je]ollat.e < CllAllcs|lul|ze, 


with C independent of « € (0,1). (Hint: Write A(x) — A(y) = >> Be(z,y) 
(te — Yr), |Be(x, y)| < K, and, with ge(a) = 07 /Oxe, 


ard, Jelo(e) => f Belew) [eae (24) =") oy) dy, 
with absolute value bounded by 
Ke" Sf |oue(e'(@— ))| -fo(o)lay, 


where yre(x) = reqe(x).) 
3. Using Exercise 2, show that 


II[A, Je]Oju|]z2 < CllAllcs|lullze. 
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2. Sobolev imbedding theorems 
We will derive various inclusions of the type H’?(M) Cc H®%(M). We will 
concentrate on the case MM = R”. The discussion of § 1 will give associated 
results when JV is a compact manifold, possibly with (smooth) boundary. 

One technical tool useful for our estimates is the following generalized Holder 


inequality: 


Lemma 2.1. Ifp; € [1,00], 0p; * = 1, then 


(2.1) 1 hipaioae hae S [fel ceoaay > thdllceetan: 
M 


The proof follows by induction from the case m = 2, which is the usual Holder 
inequality. 
Our first Sobolev imbedding theorem is the following: 
Proposition 2.2. For p € [1,n), 
(2.2) HYP (R") c LP/-?) (RR). 
In fact, there is an estimate 
(2.3) lull pnesin—v) < Cl|Vullze, 
foru € H'?(R”), with C = C(p,n). 


Proof. It suffices to establish (2.3) for u € Cf°(R”). Clearly, 


so 
es 1/(n—-1) 
(2.5) u(x) < ¢ YT / |Dju| dex; 
re es 
We can integrate (2.5) successively over each variable x;, 7 = 1,...,n, and 
apply the generalized Hélder inequality (2.1) with m = py =-:-=pm=n-—1 
after each integration. We get 
1/n 
(2.6) lull ens» < 4] / |Djul dx < C||Vullz.. 


J=lpn 
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This establishes (2.3) in the case p = 1. We can apply this to v = |ul7, y > 1, 
obtaining 


Cn | <Olflul™ "Valls < Clllel™™ Tp 


Le/(n-1) = 


For p < n, pick y = (n— 1)p/(n — p). Then (2.7) gives (2.3) and the proposition 
is proved. 


Given u € H*-?(IR”), we can apply Proposition 2.2 to estimate the prelin=P). 
norm of D*~'y in terms of ||D*1u||», where we use the notation 


(2.8) D'u ={D°u: |a| =k}, ||D*ullep = S> DP ullze, 
ja|=k 


and proceed inductively, obtaining the following corollary. 
Proposition 2.3. For kp <n, 
(2.9) HEP (R”) c pnP/(n—kp)(R”), 


The same result holds with R” replaced by a compact manifold of dimension n. 
If we take p = 2, then for the Sobolev spaces H*(IR”) = H*-?(IR”), we have 


(2.10) HER") Cc L27/(m-2k)(R) kh < . 
Consequently, the interpolation theory developed in Chap. 4 implies 
(2.11) PPR) C179), 


for any real s € [0,k], k < n/2 an integer. Actually, (2.11) holds for any real 
s € [0,n/2), as will be shown in § 6. We write down some particular examples, 
for n = 2,3,4, which will play a role later in various nonlinear evolution equa- 
tions, such as the Navier-Stokes equations. The cases n = 3,4 follow from the 
results proved above, while the case n = 2 follows from the general case of (2.11) 
established in § 6. 


HeychR) AR) c rR) 
(2.12) H°/4(R3) c L4(R?) 
H/?(R?) Cc L4(R?) H}/?(R3) Cc L3(R3) 


Note that interpolation of the R?-result with L?(IR?) = L?(IR?) yields 


H/3(R?) c L3(R?). 
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The next result provides a partial generalization of the Sobolev imbedding 
theorem, 


H*(R") CO(R"), 8>5, 
proved in Chap. 4. A more complete generalization is given in § 6. 
Proposition 2.4. We have 
(2.13) H®P?(R") C C(R")N L®(R"), forkp>n. 
Proof. It suffices to obtain a bound on ||u|| p-0(g») for u € H*?(R"), if kp > n. 
In turn, it suffices to bound u(0) appropriately, for u € C§°(IR”). Use polar coor- 


dinates, x = rw, w € S$", Let g € C™(R) have the property that g(r) = 1 for 
r <1/2 and g(r) = 0 for r > 3/4. Then, for each w, we have 


(0) = = fF tatrdulrw)] ar 


7 oe oe (2) ptritrso} rl dp, 


upon integrating by parts k — 1 times. Integrating over w € S”—! gives 


(2) truce 


where B is the unit ball centered at 0. Hélder’s inequality gives 


dx 


’ 


1u(0)| < ¢ fr 
B 


(2.14) [u(0)| < Cllr?" ee wy llr[g(r)u(@)] || po asy> 


with 1/p + 1/p’ = 1. We claim that (0/0r)* is a linear combination of 
D*, \a| =k, with L°-coefficients. To see this, note that 0” annihilates x for 
la| < k, so we get 


F) k 
(2.15) (5) = Gano" 


SO Gq (x) is homogeneous of degree 0 in a and smooth on R” \ 0. 


Exercises 7 


Returning to the estimate of (2.14), our information on (0/0r)* implies that 
the last factor on the right side is bounded by the H*:?-norm of u. The factor 
| Esau |e (B) is finite provided kip > n, so the proposition is proved. 


To close this section, we note the following simple consequence of 
Proposition 2.2, of occasional use in analysis. Let MM(IR") denote the space 
of locally finite Borel measures (not necessarily positive) on R”. Let us assume 
that n > 2. 


Proposition 2.5. [f we have u € M(R”) and Vu € M(R"), then it follows that 
w € LRN pr), 

Proof. Using a cut-off in C§°, we can assume u has compact support. Applying 
a mollifier, we get u; = x; * u € C§°(R”) such that u; > u and Vu; > Vu 
in M(R"). In particular, we have a uniform L1-norm estimate on Vu,;. By (2.3) 


we have a uniform L”/("—)-norm estimate on u;, which gives the result, since 
L”/(—1) (R") is reflexive. 


Exercises 


1. If p; € [1,00] and u; € Ls, show that uiu2 € L” provided rot =p, t+ pe7t 


[0, 1]. Show that this implies Lemma 2.1. 
2. Use the containment (which follows from Proposition 2.2) 


€ 


HY? (R") C HYnP/ (DP) (R") if (k—1)p<n 


to show that if Proposition 2.4 is proved in the case k = 1, then it follows in general. 
Note that the proof in the text of Proposition 2.4 is slightly simpler in the case k = 1 
than for k > 2. 

3. Suppose k = 22 is even. Suppose u € S’(R”) and 


(-A+1)'u = f € L?(R”). 


Show that m 
u=TIexf, Tel) = (6). 
Using estimates on 7%, (a) established in Chap. 3, § 8, show that 


kp >n => u€ C(R")NL™(R"). 


Show that this gives an alternative proof of Proposition 2.4 in case k is even. 
4. Suppose k = 2¢+ 1 is odd, kp > 1. Use the containment 


H®P(R") Cc HRTEP/ PR") fp <n, 


which follows from Proposition 2.2, to deduce from Exercise 3 that Proposition 2.4 
holds for all integers k > 2. 
5. Establish the following variant of the & = 1 case of (2.14): 


(2.16) |u(O) — u(x)| < Cl|Vullrowsy), p>n, x € OB. 
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(Hint: Suppose x = e1. If yz is the line segment from 0 to z, followed by the line 
segment from z to e1, write 


u(ex) - u(0) = f (f au) dS(2), p= {seBim=5h. 


Show that this gives u(e1) — u(0) = J, Vu(z) - y(z) dz, with yp € L*(B), Vq < 
n/(n —1).) 


. Show that H"1(R”) C C(R") NM L®(R"). 


(Hint: u(x) = Fi tee f. Dy---Dnu(a + y) dy +--+ dyn.) 


3. Gagliardo—Nirenberg—Moser estimates 


In this section we establish further estimates on various L?-norms of derivatives 
of functions, which are very useful in nonlinear PDE. Estimates of this sort arose 
in work of Gagliardo [Gag], Nirenberg [Ni], and Moser [Mos]. Our first such 
estimate is the following. We keep the convention (2.8). 


Proposition 3.1. For realk > 1, 1<p<k, we have 


(3.1) || Dull Z2x/y < Cllullp2xs@—y - || DF ull p2x/@+n 5 


(3.2) N 


for all u € O§°(R"), hence for all u € L®(R") ON H*:™, where 


2k 2k 


ee 


Proof. Given v € CS°(R”), q > 2, we have v|v|4-? € Cd (R”) and 


D;(v|v|?-*) = (q — 1)(D;v)|o|*-?. 


Letting v = D;u, we have 


|Djul? = Dj(u Dju|Djul**) — (q— 1)u Deu|Djult?. 


Integrating this, we have, by the generalized Holder inequality (2.1), 


a 
(3.3) |Djullte < la— 1 - lull ||DFullca |Djullf”, 


where ¢ = 2k/p and q; and q are given by (3.2). Dividing by ||D,ul|%;7 gives 
the estimate (3.1) for u € C§°(R”), and the proposition follows. 


If we apply (3.1) to D’~1u, we get 


(3.4) || D*u|[F ons < Cll De ul] pae/@—» || D2 ull p2ereo+n, 


3. Gagliardo—Nirenberg—Moser estimates 9 
for realk > 1, p € [1,k], € > 1. Consequently, for any < > 0, 
(3.5) || Dull pansp < Cel|DP | panyee—y + C(e)||D2t ull paxsco42)- 


If p € [2,k] and @ > 2, we can apply (3.5) with p replaced by p — 1 and D*~!u 
replaced by D‘~?u, to get, for any €, > 0, 


(3.6) || De ul pen;@— < Ce,||De-2 ull p2%/te-2) af C(é1)||D*ull paxsp- 


Now we can plug (3.6) into (3.5); fix €; (e.g., €1 = 1), and pick € so small that 
CeC(e1) < 1/2, so the term CeC(e1)||D*ull p2x/» can be absorbed on the left, 
to yield 

(3.7) || D’ull p2e72 < Ce||D* 7 ull p2ese—2) + C(e)|| Dot ul] paescw4, 


for real k > 2, p € [2,k], € > 2. Continuing in this fashion, we get 
(3.8) || Dull p2%/2 < Cel| Do Full paxs@-» + C(e)|| Dot ull pexs@4n, 


j <p<k, € > j. Similarly working on the last term in (3.8), we have the 
following: 


Proposition 3.2. If7 <p<k+1l—m, @> J, then (for sufficiently small ¢ > 0) 
(3.9) ||D°ul|paez> < Ce||D° Full paxs@-» + C(E)|D Ful] p2x/@+m). 


Here, 7, 2, and m must be positive integers, but p and k are real. Of course, the 
full content of (3.9) is represented by the case 2 = 7, which reads 


(3.10) || D* || p2%/p < Ce||ul| paxs@—0 + C(e)||D°t™ ull p2nsco+my , 


for < p< k+1-—~m. Taking p+ m = k, we note the following important 
special case. 


Corollary 3.3. If 0, p, and k are positive integers satisfying € < p< k —1, then 


(3.11) || D’ul| pax/e2 < Cellul| pax/@—e + C(e)||D*t* Pull ze. 


In particular, taking p = @, if 2 < k, then 
(3.12) || Dé ull p2nse < Cellul| p-- + C(e)||D®ullze, 


for all u € Cpe (R”). 

We want estimates for the left sides of (3.11) and (3.12) which involve prod- 
ucts, as in (3.1), rather than sums. The following simple general result produces 
such estimates. 
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Proposition 3.4. Let ¢, uu, and m be nonnegative integers satisfying € < max 
(u,m), and let q,r, and p belong to (1, co]. Suppose the estimate 


(3.13) || D°ullze < Cy||D“ullzr + C2||D™ ull zo 


is valid for all u € C§°(R"). Then 


(3.14) ||D’ullne < (Cy + Co) DM ule «Imus er”), 

with 

G.15) a= pt B=-2 42 mse, 
oe q Pp 


provided these quantities are not both zero. If (3.13) is valid and the quantities 
(3.15) are both nonzero, then they have the same sign. 


Proof. Replacing u(x) in (3.13) by u(sa) produces from (3.13), which we write 
schematically as Q < C, R+ CoP, the estimate 


sf "/IQ < Cyst" R+Cos™—/"P, forall s > 0, 


or equivalently, 
Q < Cis*R+Cs~8P, forall s > 0, 


with a and ( given by (3.15). If a and ( have opposite signs, one can take s —+ 0 
or s — oo to produce the absurd conclusion Q = 0. If they have the same sign, 
one can take s so that s*R = s~®P = P*R®, which can be done with a = 
a/(a+ 6), b= 6/(a + (), and the estimate (3.14) results. 


Applying Proposition 3.4 to the estimate (3.11), we find a = (n — 
2k)t/2k, B = (n — 2k)(k — p)/2k, which gives the following: 


Proposition 3.5. If ¢, p, and k are positive integers satisfying € < p< k—1, then 


ba kte— 1p L/(k+l— 
(3.16) Deel zane < Chul PVG - Deter ee, 


In particular, taking p = ¢, if 2 < k, then 


(3.17) || D°ull arse < Cllull ao! « Deal. 


One of the principal applications of such an inequality as (3.17) is to bilinear 
estimates, such as the following. 


3. Gagliardo—Nirenberg—Moser estimates 


Proposition 3.6. If || + |y| = k, then 


(3.18) I|(D? f)(D7g)llnz < Cllfllc~llgllax + Cllfllarligliz-, 
for all f,g € Co(R") MN H*(R"). 


Proof. With |G] = @,|y| = m, and 2+ m = k, we have 


(3.19) I|(D°f)(D g)llzz < ||!D° fll zexre - |D7gll z2x7m 
Le 


Hk Lx 


using Hoélder’s inequality and (3.17). We can write the right side of (3.19) as 


(3.20) C(\lfllo~ lalla)” * (WA llazellall z=), 


and this is readily dominated by the right side of (3.18). 


The two estimates of the next proposition are major implications of (3.18). 


Proposition 3.7. We have the estimates 


(3.21) If + glla* < Cll fllz=llglla« + Cll flare llgilze 


and, for |a| < k, 


(3.22) |[D°(f-g) — fD°gllz2 < Cllfllaeligiiz~ + CVF llz~ llgllae-. 


1-—2/k L/k 1l—m/k m/k 
<Olflre Wee lal ee 


Proof. The estimate (3.21) is an immediate consequence of (3.18). To prove 


(3.22), write 

(3.23) D*(f-g= Do (3) (D*f) (D9), 
B+y=a 

so, if |a| = k, 


Dy-9-fD%9=  Y (3) (D*N(D) 


(3.24) B+ y=a,h>0 


= S ClDP DPD 9). 
6l+ly=k-1 


Hence, with u; = D;f, 


(3.25) |D°(f9) — fD°glaz<C YS > |\(DPuj)(D7g)|Iz2. 


[Bl+ly|=k-1 
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From here, the estimate (3.22) follows immediately from (3.18), and Proposition 
3.7 is proved. Note that on the right side of (3.22), we can replace || f||z7« by 


I|V fllare-1- 


From Proposition 3.4 there follow further estimates involving products of 
norms, which can be quite useful. We record a few here. 


Proposition 3.8. We have the estimates 


(3.26) lulz < C|D™* ull. |D™ Tul, for u © CX (R?”), 
and 
(3.27) |jullz~ <O|D™ ull ||D™ ull”, foru< Coe(Re"?), 


Proof. It is easy to see that 

(3.28) ||21||2.0 < C||D™t*ullz2 +C||D™*ullZ2, for u € CS°(R?”), 
and 

(3.29) l[2||7..0 < C\|D™tul[?2 + C|_D™ullZ2, for u € Co°(R?"T"). 


Proposition 3.4 then yields a = 3 = 1 in case (3.28) and a = @ = 1/2 in case 
(3.29), proving (3.26) and (3.27). 


A more delicate L°°-estimate will be proved in § 8. 
It is also useful to have the following estimates on compositions. 


Proposition 3.9. Let F be smooth, and assume F(0)=0. Then, for we H* 
MER, 


(3.30) |F(u) lle < Ce ([lullace) (1 + llullare). 


Proof. The chain rule gives 


D°F(u)= > Cou)... uGo) PH (uy), 
Bit--By=o 
hence 
(3.31) |D¥F(u)l|n2 < Cp([lellz~) Yo [fu --- uJ] 


From here, (3.30) is obtained via the following simple generalization of 
Proposition 3.6: 
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Lemma 3.10. /f|(3,| + +--+ |G,| =k, then 


(3.32) (AP) = 4 [Ln2 < CDI allew +-- Poll zoe + UM full c= LF. 


Proof. The generalized Hélder inequality dominates the left side of (3.32) by 


(3.33) A? reerient = FE IL p2erieut- 


Then applying (3.17) dominates this by 


1—|A1|/k k 1-|6,|/k l/h 
(3.34) Chall Ae fall pale, 
which in turn is easily bounded by the right side of (3.32) (with f = (fi,..-, fy). 


We remark that Proposition 3.9 also works if u takes values in R”. The esti- 
mates in Propositions 3.7 and 3.8 are called Moser estimates, and are very useful 
in nonlinear PDE. Some extensions will be given in (10.20) and (10.52). 


Exercises 


1. Show that the proof of Proposition 3.1 yields 
(3.35) ||Djullia < Cllul|za: - |D°ul| cas 


whenever 2 < g < co, 1 < qj; < oo, and 1/q + 1/q2 = 2/gq. Show that if qo < 
q < qi, then (3.35) and (3.1) are equivalent. Is (3.35) valid if the hypothesis q > 2 is 
relaxed to g > 1? 

2. Show directly that (3.35) holds with qi = q2 = q € [1, oo]. (Hint: Do the next exer- 
cise.) 

3. Let A generate a contraction semigroup on a Banach space B. Show that 


(3.36) || Awl]? < 8||ul] - || A?ul], for w € D(A”). 


(Hint: Use the identity —tAu = t(t — A)~'A®u 4+ t?u — #7t(t — A)~*u together 
with the estimate ||¢(¢ — A)~*|| < 1, for t > 0, to obtain the estimate ¢||Aul| < 
|| A?u|| + 2¢?||ul|, for £ > 0.) Try to improve the 8 to a 4 in (3.36), in case B is a 
Hilbert space. 

4. Show that (3.10) implies 


(3.37) || Déullze < Cy|lullz- + C2||D*t™ ull ze 


when p < q < rare related by 


1 mi, é 1 


(3.38) a ee 
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as long as we require furthermore that q > 2, in order to satisfy the hypothesis p/k < 
1—(m—1)/k used for (3.10). In how much greater generality can you establish (3.37)? 
Note that if Proposition 3.4 is applied to (3.37), one gets 

(3.39) |D%ullca < Cll" Dale, 


provided (3.38) holds. 

5. Generalize Propositions 3.6 and 3.7, replacing L? and H® by L? and H™?. Use (3.10) 
to do this for p > 2. Can you also treat the case 1 < p < 2? 

6. Show that in (3.30) you can use C;,(||u||z-°) with 


(3.40) Cy(A)= sup |F“)(x)|. 


|2|<A,p<k 


7. Extend the Moser estimates in Propositions 3.7 and 3.9 to estimates in H**?-norms. 


4. Trudinger’s inequalities 

The space H”/?(R”) does not quite belong to L~(R”), although H”/?(R") C 
L?(R”) for all p € [2, 00). In fact, quite a bit more is true; exponential functions 
ofu € H"/? (IR”) are locally integrable. The proof of this starts with the following 
estimate of ||u|| ,> (Rn) a8 p — oo. 

Proposition 4.1. [fu € H"/?(R"), then, for p € [2, 00), 

(4.1) ellen) S Cup? lle srms2 en): 

Proof. We have u = A~"/?v for v € L?(IR"), where, recall, 

(4.2) (A~*v) *(€) = (€)°0(6). 


Hence, with v € L?(R"), 


(4.3) Uu= TIn/2 *U, 
where 
(4.4) i= 


The behavior of 7;,/2(x) follows results of Chap. 3. By Proposition 8.2 of Chap. 3, 
Inj2(x) is C°° on R” \ 0 and vanishes rapidly as |a| — 00. By Proposition 9.2 
of Chap. 3, we have 


(4.5) Injolx) < Cla|-™?, for |x| <1. 
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Consequently, 7,,/2 just misses being in L?(IR"); we have, for 6 € (0, 1], 
: C. 

(4.6) \|Fnjall tascam) SO + c| pele? dpe ae 
Now the map K, defined by Kk, f = v « f, with v given in L?(R"), satisfies 
(4.7) il AL Boxter, 
both maps having operator norm ||v|| ;2. By interpolation, 
(4.8) Kv f llzecen) S Wf lla) - [lullz2qr»y, for g € [1, 2], 


where p is defined by 1/q — 1/p = 1/2. Taking f = Jnj2, q = 2 — 6, we have, 
for v € L?(R"), 


Cy, \ 1/(2-8) 2(2 — 0) 

P < 2 = 

49) Wnjaeollin < (2) Itollee, p=, 
which gives (4.1). 


The following result, known as Trudinger’s inequality, is a direct consequence 
of (4.1): 


Proposition 4.2. [fu € H”/?(R”), there is a constant y = y(u) > 0, of the form 


Yn 
(4.10) y(u) = a> 
[lel 3502 
such that 
(4.11) (enor = 1) da < co. 
R” 


If M is a compact manifold, possibly with boundary, of dimension n, and if u € 
H"/?(M), then there exists y = WM) /llellFpn/2¢00) such that 


(4.12) fener dV (x) < 00. 
M 


Proof. We have 


2 m 
u(a)|? “y Y m 
ee? 1 = glue)? + > lula)! + os+ 2 July" ++ 
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By (4.1), 
“yi 2m past m 2m 
(4.13) — f lula)" dV (a) < CL" — (2m)™ ull zpny2, 
Mm: mM: 


which is bounded by C’«”, for some & < 1, if 7 has the form (4.10), with 
Yn < 1/(2eC?), as can be seen via Stirling’s formula for m!. This proves the 
proposition. 


We note that the same argument involving (4.2)—-(4.8) also shows that, for any 
p € [2, 00), there is an € > 0 such that 


(4.14) H™/2-€(R") c LP(R"). 


Similarly, we have H"/?-*(M) Cc L?(M), when M is a compact manifold, per- 
haps with boundary, of dimension n. By virtue of Rellich’s theorem, we have for 
such M that the natural inclusion 


(4.15) t: H"/?(M) — L?(M) is compact, for all p < 00. 


Using this, we obtain the following result: 


Proposition 4.3. If M is a compact manifold (with boundary) of dimension 
n, a ER, then 


(4.16) uj —> u weakly in H”/?(M) => ei = e™ in L'(M)-norm. 
Proof. We have 
AU; au |o|” m m 
jes =| < SS [uso = fue). 
m>1 : 

If |[ujllan/2(ar) < A, we obtain 

| au; au < |o|™" : jpm-l m—-1 

et — ote < SO as — ulin ml ltaslt + lal 


m<k 


(4.17) cai 


m>k 
where we use 
[leuj|” — Jul™| < rrojey — el (Jeg + ful") 


to estimate the sum over m < k, and we use (4.1) to estimate the sum over m > k. 
By (4.15), for any k, the first sum on the right side of (4.17) goes to 0 as 7 — oo. 
Meanwhile the second sum vanishes as k — oo, so (4.16) follows. 
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Exercises 


1. Partially generalizing (4.10), let p € (1,00), and let u € H”?(R”), withkp =n, k € 
Z* . Show that there exists 7 = p(w) such that 
(4.18) erle(a) PFO 


|2i<R 


dx < Cor. 


For a more complete generalization, see Exercise 5 of § 6. 
Note: Finding the best constant y in (4.18) is subtle and has some important uses; see 
[Mos2], [Au], particularly for the case k = 1, p=n. 


5. Singular integral operators on L? 


One way the Fourier transform makes analysis on L?(IR”) easier than analysis 
on other L?-spaces is by the definitive result the Plancherel theorem gives as a 
condition that a convolution operator k*u = P(D)u be L?-bounded, namely that 
k(€) = P(€) be a bounded function of €. A replacement for this that advances 
our ability to pursue analysis on L” is the next result, established by S. Mikhlin, 
following related work for L?(T”) by J. Marcinkiewicz. 


Theorem 5.1. Suppose P(E) satisfies 

(5.1) ID°P(O| < Caley", 

for |a| <n-+1. Then 

(5.2) P(D) : L?(R") —> L?(R"), for1 <p<ov. 


Stronger results have been proved; one needs (5.1) only for |a| < [n/2]+1, and 
one can use certain L?-estimates on the derivatives of P(€). These sharper results 
can be found in [H1] and [S1]. Note that the characterization of P(€) € S)(R”) 
is that (5.1) hold for all a. 

The theorem stated above is a special case of a result that applies to pseu- 
dodifferential operators with symbols in se 5(IR”). As shown in § 2 of Chap. 7, if 
p(a,&) satisfies the estimates 


(5.3) | D2 De p(x, £)| < Cag(é)~!al+141, 
for 
(5.4) a}<1, jal<n+1+|6l, 


then the Schwartz kernel K(x, y) of P = p(x, D) satisfies the estimates 


(5.5) |K(x,y)| < Cla —y|™ 
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and 

(5.6) [VewK (x,y)| < Cla — yl". 

Furthermore, at least when 6 < 1, we have an L?-bound: 

(5.7) |Pullz2 < K|lullrz, 

and smoothings of such an operator have smooth Schwartz kernels satisfying 
(5.5)-(5.7) for fixed C, kK. (Results in §9 of this chapter will contain another 
proof of this L?-estimate. Note that when p(x, €) = p(€) the estimate (5.7) fol- 
lows from the Plancherel theorem.) Our main goal here is to give a proof of the 


following fundamental result of A. P. Calderon and A. Zygmund: 


Theorem 5.2. Suppose P : L?(R") — L?(R") is a weak limit of operators with 
smooth Schwartz kernels satisfying (5.5)-(5.7) uniformly. Then 


(5.8) P: L>(R") — L?(R"), 1<p<oo. 
In particular, this holds when P € OPS} ;(R"), 6 € [0,1). 

The hypotheses do not imply boundedness on L1(IR”) or on L®(IR”). They 
will imply that P is of weak type (1,1). By definition, an operator P is of weak 
type (q, q) provided that, for any A > 0, 

(5.9) meas {x : |Pu(x)| > A} < CA~4ull4,. 


Any bounded operator on L’ is a fortiori of weak type (q, q), in view of the simple 
inequality 


(5.10) meas {x : |u(x)| > A} < AT" Jull zs. 
A key ingredient in proving Theorem 5.2 is the following result: 
Proposition 5.3. Under the hypotheses of Theorem 5.2, P is of weak type (1,1). 


Once this is established, Theorem 5.2 will then follow from the next result, 
known as the Marcinkiewicz interpolation theorem. 


Proposition 5.4. [fr < p < q and if T is both of weak type (r,r) and of weak 
type (q,q), then T : L? + L?. 


Proof. Write u = wu; + ue, with u(x) = u(x) for |u(x)| > A and u(x) = u(x) 
for |u(a)| < A. With the notation 


(5.11) jog(A) = meas {x : | f(x)| > A}, 
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we have 


Mru(2A) < peru, (A) + brug (A) 
(5.12) < CYA ual + C2rA~4||ual|h0- 


Also, there is the formula 


fuer dx = rf pg (A)APt dd. 


Hence 
fivecaye ae =p [ pra(ayre aa 
0 
(5.13) < cp | aa | ju(x)|" de) dd 
: Jul >A 
+Cap | eal ju(a) [2 duc) dd. 
0 
Julia 
Now 
(5.14) i as | : lu(a)|" de) dd = | |x) i 
: Jul>a a 
ul> 


and, similarly, 
(5.15) [ arta / lu(a)|? de) dd = | |x) de, 
) q—p 


Combining these gives the desired estimate on ||T'u||7 


We will apply Proposition 5.4 in conjunction with the following covering 
lemma of Calderon and Zygmund: 


Lemma 5.5. Let u € L'(R") and \ > 0 be given. Then there exist v,wp € 
L1(R") and disjoint cubes Qy, 1 < k < 00, with centers x}, such that 


(5.16) u=vt > we, llollo +5 llweller < 3lullzs, 
k k 

(5.17) |u(a)| < 2"A, 

(5.18) fenlo) dx =0 and supp wp C Qk, 


Qk 


20 13. Function Space and Operator Theory for Nonlinear Analysis 
(5.19) S> meas(Qx) < A+ ull. 
k 


Proof. Tile R” with cubes of volume greater than \~!||u||;1. The mean value 
of |u(x)| over each such cube is < 4. Divide each of these cubes into 2” equal 
cubes, and let /11, [,2, /13,... be those so obtained over which the mean value of 


|w(a)| is > A. Note that 


(5.20) A meas(Ii,) < / \u(a)| dx < 2” meas(Iix). 
Tir 
Now set 
(5.21) — / (y) dy, fora € 1 
: UL) = meas(J1;.) Uy Y; x 1k, 
Nik 
and 
(5.22) wip(z) = u(x) — v(x), fora € Kz, 


0, fora ¢ Tix. 


Next take all the cubes that are not among the [;,, subdivide each into 2” equal 
parts, select those new cubes Jo, Ia2,..., over which the mean value of |u(a)| is 
> A, and extend the definitions (5.21)—(5.22) to these cubes, in the natural fashion. 
Continue in this way, obtaining disjoint cubes J; and functions w,,,. Then reorder 
these cubes and functions as Q1, Q2,..., and w 1, w2,.... Complete the definition 
of v by setting u(x) = u(x), for 2 ¢ UQx. Then we have the first part of (5.16). 
Since 


(5.23) foe + |wz(x)|) dx < 3/ |u(x)| da, 
Qk 


Qk 


and since the cubes are disjoint, w, is supported in Qz, and v = u on R” \ UQx, 
we obtain the rest of (5.16). 

Next, (5.17) follows from (5.20) if « € UQ,. But if « ¢ UQx, there are 
arbitrarily small cubes containing x over which the mean value of |u(x)| is < A, 
so (5.17) holds almost everywhere on R” \ UQ, as well. The assertion (5.18) is 
obvious from the construction, and (5.19) follows by summing (5.20). The lemma 
is proved. 


One thinks of v as the “good” piece and w = >~ wy, as the “bad” piece. What 
is “good” about v is that ||v||72 < 2”A|lul]z1, so 


(5.24) ||Pulli2 < Kllullis < 4"K?Allulza. 
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Hence 
A\?2 A 
(5.25) (5) meas « : |Pu(a)| > =} < CAlullz. 


To treat the action of P on the “bad” term w, we make use of the following es- 
sentially elementary estimate on the Schwartz kernel kK. The proof is an exercise. 


Lemma 5.6. There is a Cg < co such that, for any t > 0, if |y| < t,v9 € R”, 


(5.26) / | K(x, xo +y)—K(a, xo)| dz < Co. 


|w—a9|>2t 
To estimate Pw, we have 
Pun(e) = [ K(e,y)welu) dy 


= [Kew — K(a,xx)| wey) dy. 


Qk 


(5.27) 


Before we make further use of this, a little notation: Let Q;, be the cube concentric 
with Q;,, enlarged by a linear factor of 2n1/?, so meas Q = (4n)"/? meas Qy. 
For some t, > 0, we can arrange that 


C yx: |x — wp] Steg, 
(5.28) Qk e | ; | < te} 
Y, =R”"\ Qi C {x: |x — x,| > 2ty}. 
Furthermore, set O = UQ;, and note that 
(5.29) meas O < LAT" |lullz1, 


with L = (4n)"/?. Now, from (5.27), we have 


[ Penta) ae 
Yr 


(5.30) = / i |K (a+ cp, 0% +y) — K(x + 2x, 2%)| 
ly|<te, |v] >2t, 
‘|we(y + vx)| dx dy 
< Collwellz:, 


the last estimate using Lemma 5.6. Thus 


(5.31) / |Pw(a)| dx < 3Colulz:. 


R"\O 
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Together with (5.29), this gives 


r Ch 
(5.32) meas « :|Pw(x)| > =\ < 5 Welle, 


and this estimate together with (5.25) yields the desired weak (1,1)-estimate: 
C2 
(5.33) meas{x : |Pu(z)| >A} < 5 Helles 


This proves Proposition 5.3. 

To complete the proof of Theorem 5.2, we apply Marcinkiewicz interpolation 
to obtain (5.8) for p € (1, 2]. Note that the Schwartz kernel of P* also satisfies 
the hypotheses of Theorem 5.2, so we have P* : L? + L?, for 1 < p < 2. Thus 
the result (5.8) for p € [2, 00) follows by duality. 

We remark that if (5.6) is weakened to |V,, K (a, y)| < C|x—y|~"~", while the 
hypotheses (5.5) and (5.7) are retained, then Lemma 5.6 still holds, and hence so 
does Proposition 5.3. Thus, we still have P : L?(IR”) > L?(R”) for 1 < p < 2, 
but the duality argument gives only P* : L?(IR") > L?(IR") for2 <p<o. 

We next describe an important generalization to operators acting on Hilbert 
space-valued functions. Let H, and H2 be Hilbert spaces and suppose 


(5.34) P: L?(R", Hi) — L?(R", Ha). 

Then P has an £(H1, H2)-operator-valued Schwartz kernel A. Let us impose 
on the hypotheses of Theorem 5.2, where now |K(x,y)| stands for the 
L(Hi, H2)-norm of K(x, y). Then all the steps in the proof of Theorem 5.2 


extend to this case. Rather than formally state this general result, we will concen- 
trate on an important special case. 


Proposition 5.7. Let P(€) € C@(R”, L(H1, H2)) satisfy 

(5.35) IDE P(E)ll ccna) S Calé) "|, 

for all a > 0. Then 

(5.36)  P(D) : L?(R",H,) —> L?(R",Ha), forl <p<oo. 


This leads to an important circle of results known as Littlewood—Paley theory. 
To obtain this, start with a partition of unity 


(5.37) 1= >) 9; (6), 
j=0 


where y; € C'®, yo(€) is supported on |€| < 1, y1(€) is supported on 1/2 < 
IE] < 2, and yj (€) = yi (2' YE) for j > 2. We take Hy; = C, He = £”, and look 
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at 

(5.38) &:77(R")— F(R”, P) 

given by 

(5.39) ®(f) = (vo(D) Ff, 91(D) f, y2(D) Ff, ---)- 


This is clearly an isometry, though of course it is not surjective. The adjoint 


(5.40) &* : L7(R”, °) —> L?(R"), 
given by 

(5.41) ®* (go, 91,92,---) = > 9; (D) gy, 
satisfies 

(5.42) &* d=] 


on L?(R”). Note that ® = 6(D), where 


(5.43) &(€) = (po(€), 91(€), P2(€),- Sage 


It is easy to see that the hypothesis (5.35) is satisfied by both ®(€) and ®*(€). 
Hence, for 1 < p< o, 

& : L?(R") —> L?(R", ), 
(5.44) &* : L?(R", 2) —> L?(R"). 


In particular, ® maps L?(R”) isomorphically onto a closed subspace of 
L?(R”, ¢?), and we have compatibility of norms: 


(5.45) lullze © ||Gull r>cen ey. 


In other words, 
/ . 2 a2 
(5.46) Callin < |e besru?} | pe < Collullze, 
j= 


forl <p<o. 
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Exercises 


1. Estimate the family of symbols ay(€) = (€)’”, y € R. Show that if A” = ay(D), 
then 


(5.47) Aull zocmmy S Op(y)”* ull rex). 


This estimate will be useful for the development of the Sobolev spaces H*’? in the next 
section. 

2. Let wi(€) be supported on 1/4 < |é| < 4, di(€) = 1 for 1/2 < |€| < 2, and 
w;(€) = dn (2'-JE) for j > 2. Let s € R. Show that 


A(D), B(D) : L?(R”, 0’) — L?(R",£"), 1<p<o, 


for 


by applying Proposition 5.7. 

3. Give a proof that 

(5.48) [iter ac=p fo usar ar, 

0 

used in (5.13). Also, demonstrate (5.14) and (5.15). (Hint: After doing (5.48), get an 
analogous identity for the integral of | f(x)|? over the set {x : |f(ax)| > A}, resp., 
<A.) 

4. Give a detailed proof of Lemma 5.6. 

5. Let A € OPS} .9(R”), and suppose A(x, €) = 0 for x, = 0. Define Tf = AP) nes 
where Ri. = {x € R” : tay > O}. Show that, for 1 < p < co, 


(5.49) f¢L?(R"), supp f CRi => Tf € L?(R*). 


(Hint: Apply Proposition 5.1 of Appendix A. Compare with Exercise 3 in §5 of Ap- 
pendix A.) 


6. The spaces H*? 


Here we define and study H*? for any s € R, p € (1,00). In analogy with the 
characterization of H*(R") = H*?(IR”) given in § 1 of Chap. 4, we set 


(6.1) A? (R") = AF E?(R"). 
Given the results of § 5, we can establish the following. 


Proposition 6.1. When s = k is a positive integer, p € (1,00), the spaces 
H*-?(R”) of § 1 coincide with (6.1). 
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Proof. For |a| < k, €*(£)~* belongs to S?(IR”). Thus, by Theorem 5.1, D°A~* 
maps L?(R") to itself. Thus any u € A~*L?(IR") satisfies the definition of 
H*-?(IR”) given in § 1. For the converse, note that one can write 


(6.2) OPS S- amloe, 


Jal <k 


with coefficients gq. € S?(R”). Thus if D’u € L?(R”) for all |a| < k, it follows 
that A*u € L?(R”). 


We next prove an interpolation theorem generalizing the identity 
[L7(R"), H*(R”)], = H°*(R”), for 0 € (0, 1], 


proven in § 2 of Chap. 4. 
Proposition 6.2. For s € R, 6 € (0,1), and p € (1, 00), 


(6.3) [L?(R"), H°?(R”")], = H°*?(R"). 


Proof. The proof is parallel to that of Proposition 2.2 of Chap.4, except 
that we use the estimate (5.47) of the last section in place of the obvious 
identity ||A’’|| =1 for a unitary operator A’” on a Hilbert space. Thus, if 
v € HP (R”), let 


(6.4) u(z) = e® AO-2)8y, 


Then u(@) = ev, u(iy) = e7¥" A~#¥5 (A°%y) is bounded in L?(R"”), by 
(5.47), and also u(1 + iy) = e~ 4" A~sA~ts (A*°v) is bounded in the space 
H*?(R”). Therefore, such a function v belongs to the left side of (6.3). The 


reverse containment is similarly established as in the proof of Proposition 2.2 of 
Chap. 4. 


This sort of argument yields more generally that, for 7,s € R, 6 € (0,1), and 
p & (1,00), 


(6.5) [OP ) F(R") |p HPO ee (RF), 


With Proposition 6.2 established, we can define and analyze spaces H*? on 
compact manifolds in the same way as we did for p = 2 in Chap. 4. If IM is a com- 
pact manifold without boundary, one defines H*-?(1/) in analogy with H*(M), 
via coordinate charts, and proves 


(6.6) [H™?(M), H°?(M)]o = H°8tG-%)>P( uy), 


for p € (1,00), 6 € (0,1). If 2 is a compact subdomain of M with smooth 
boundary, we define H*:?(Q) as in §1, and recall the extension operator FE : 
H*-?(Q) + H*-?(M). If we define H*?(Q) for s > 0 by 


26 13. Function Space and Operator Theory for Nonlinear Analysis 
(6.7) H®?(Q) = [L?(Q), H®?(Q)]9, 9 € (0,1), s = kd, 
it follows that & : H*?(Q) — H*?(M) and hence 

(6.8) H*?(Q) © H*?(M)/{u:u=0onQ}. 


Also, of course, H*?(Q) agrees with the characterization of § 1 when s = kis a 
positive integer. Generalizing the theorem of Rellich, Proposition 4.4 of Chap. 4, 
one has, fors > 0, 1<p<o, 


(6.9) u: H8*+P(Q) — H*?(Q) is compact for o > 0. 


By the arguments used in Chap. 4, we easily reduce this to showing that, for 0 > 
Ol<p<om, 


(6.10) AW? : LP(T”) —> L?(T”) is compact. 


Indeed, the operator (6.10) is of the form A~°u = k, * u, with k, € L1(T") 
for any 0 > 0. Thus k, is an L'-norm limit of kg; € C°(T"), so A~? is an 
operator norm limit of convolution maps L?(T”) > C'%°(T”), which are clearly 
compact on L?(T”). 

We now extend some of the Sobolev imbedding theorems of § 2. Once they 
are obtained on R”, they easily yield similar results for functions on compact 
manifolds, perhaps with boundary. 


Proposition 6.3. If s > n/p, then H*?(R”) C C(R")N L®(R”). 
Proof. A~*u = J, * u, where J,(€) = (€)~*. It suffices to show that 


/ 1 1 
(6.11) Je € LP (R"), fors >, —-+—=1. 
P p 


/ 


Indeed, estimates established in §8 of Chap.3 imply that 7,(a) is smooth on 
R” \ 0, rapidly decreasing as |a| — 00, and 


(6.12) ols cel*, lela, sen, 


which is sufficient. Compare estimates for s = n/2 in (4.4)-(4.9). 


Next we generalize (2.9). 


Proposition 6.4. For sp <n, p € (1,00), we have 
(6.13) H®P(R") c LP/(n—sP)(R”), 


Proof. Suppose s = k+o, k € Zt, o € (0,1). Thenu € H*? > A?u ce 
H*-?, and by (2.9) this gives A7u € L4(R"), with g = np/(n — kp). Note that 
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q € (1,00) and np/(n — sp) = nq/(n — oq), so also og < n. Hence it suffices 
to show that 


(6.14) A214 ee -R, 

when o € (0,1), g € (1,00), and oq < n. We divide the analysis into cases. 
Case I: 1 < q <n. In this case, we have, by (2.2), 

(6.15) H}4(R") Cc L7/(™—9) (R"), 


Fixing v € L4(R"), consider A~*v for z € OQ = {z € C:0 < Rez < 1}. Note 
that Proposition 5.7 implies 


(6.16) Aol] n0 < Ae?! [ull ze, 

for y € R. Making use also of (6.15), we have 

(6.17) [ATOM oll pnarin—ay) S Ae?!" [oI] z9. 

From here a complex interpolation argument gives (6.14) in this case. 


Case IT: 2 <n <q < o. In this case, set r = nq/(n — og). Note that 


(6.18) = ad an eS, 


where r’ is the dual exponent to r. We have r > gq > n > 2, sor’ <2 <n, and 
Case I gives 


(6.19) AP oP (RY) 1 OR), 

Then (6.14) follows by duality. 

Case III: n = 1. Here one needs a different approach. Since this case is not so 
crucial for PDE, we omit it. Various proofs that include this case can be found in 
[S1], [S3], and [BL]. 

The following result is an immediate consequence of the definition (6.1), 
the pseudodifferential operator calculus, and the L?-boundedness result of 
Theorem 5.2. 

Proposition 6.5. [f P ¢ OPS";(R"), 0 <6 <1, and1 < p< ow, then 


(6.20) P: H*?(R") — He-™?(R"). 
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In view of the construction of parametrices for elliptic operators, we deduce 
various H**?-regularity results for solutions to linear elliptic equations. A se- 
quence of exercises on generalized div-curl lemmas given below will make use 
of this. 


Exercises 


1. Let y;(€)? = w;(€) be the partition of unity (5.37). Using the Littlewood—Paley esti- 
mates, show that, for p € (1,00), s ER, 


ae 1/2 
(6.21) [|u| zre.2¢any {S24 \ei(D)ul?} | 
k=0 


LP(R") 


(Hint: From (5.37), we have the left side of (6.21) 


(6.22) ~({ 1|A%5(D Jul}? 


LP(R)| 


Now apply Exercise 2 of § 5.) 
Exercises 2—4 lead up to a demonstration that if 


(6.23) Ue(E) = >> vel)’, 


e<k 


then, for s > 0, p € (1,00), 


ie 7 1/2 
(ary | 
k=0 e 


(6.24) > Ve(D) fel, Cop 
k=0 


2. Show that the left side of (6.24) is 


co 


=({Lleoval PY" L,, 


k= 


“HE Pocaly 
£=0 k= 


where fy = A” *ux. (Hint: Use arguments similar to those needed for Exercise 1.) 
3. Taking w, = 2** f,, argue that (6.24) follows given continuity of 


we T(D) : EP(R", 2) — L?(R",2), 
where 

Tre(€) = e(€)2- 8, for £ > k, 

eee 0, foré<k. 


4. Demonstrate the continuity (6.25), for p € (1,00), s > 0. 
(Hint: To apply Proposition 5.7, you need 


IDET (llc) < c,(é) 1, s>0. 


Exercises 


Obtain this by establishing 


>> Del ee(Q)| < Ce), 220, 
k 


and 


S~[DeT eel < C(O), 8 > 0.) 
e 
5. fue H”/?-P(R”), p € (1, 00), show that, for g € [p, co), 


\|2|| Lacan) < Cog? ll aadaatany: 
Deduce that, for some constant y = y(u) > 0, 
(6.27) {neue = 1) da < 00, 
Rn 


thus extending Trudinger’s estimate (4.10). See [Str]. 
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The purpose of the next exercise is to extend the Gagliardo—Nirenberg estimates 


(3.10) to nonintegral cases, namely 
(6.28) lull zpa.s7> S Cillullgs7@—ay + Collull patu.s/@+e)s 
given real p, s, A, and yu satisfying 
(6.29) l<p<ow,0<p<s-—p, and r€ (0,p). 
6. Establish the interpolation result 


(6.30) [E2/@-A) (R*), Er ths/ (PtH) (R")], Cc H™®s/?(R”), d= —_, 


under the hypotheses (6.29). Show that this implies (6.28). 


(Hint: If f = u(@) belongs to the left side of (6.30), with u(z) holomorphic, u(iy) and 
u(1 + iy) appropriately bounded, consider v(z) = A~*+")*u(z). Use the interpola- 


tion result 
yee gpitere)) —[s/? ga ) 
b é ? ba 


Can you treat the p = case, where L*/(?-*) = 1°? 
7. Extend (6.30) to Sobolev inclusions for [H*?, Hg. 


Exercises on generalized div-curl lemmas 


Let M be a compact, oriented Riemannian manifold, and assume that, for 7 = 1,... 


v €Z*, oj, are (;-forms on M, such that 
(6.31) Ojv —> 0; weakly in L?7(M), asv > oo, 
and 


(6.32) {doj, : v > 0} compact in H~'?3(M). 


Lk, 


30 13. Function Space and Operator Theory for Nonlinear Analysis 


Assume that 


1 1 
(6.33) pj € (1,00), —+---+—<1. 
Pl Pk 
The goal is to deduce that 
(6.34) Ow A+++ AN Ogy —* 01 A+**Aox in D'(M), 


as vy — oo. An exercise set in § 8 of Chap. 5 deals with the case k = 2, pi = po = 2, 
which includes the div-curl lemma of F. Murat [Mur]. As in that exercise set, we follow 
[RRT]. 

1. Show that you can write oj, = dajy + 3j., where aj, — aj weakly in H'?3(M) 
and {3;,} is compact in L?/ (1). (Hint: Use the Hodge decomposition o = déGo + 
ddGo + Po. Set aj, = dGojy.) 

2. Show that, for 7 < k, 


dary \+++ A dajy —> day \--+ A da; 
in D'(M). If p:~*+-+-+p;~' = qj;~* < 1, show that this convergence holds weakly 


in LY (M). 
(Hint: Use induction on j, via 


[da A+++ A daj+iy AG = + [ don, A+++ A dajy A aj+i,v A dy.) 


3. Now prove (6.34). (Hint: Expand (dai, + Biv) A+++ A (dag + Bev). For a term 


+(daeyy Ari: AN da,v) /\ ( Bapigs Aivee[N Be,v); 


establish and exploit weak L?-convergence of the first factor (if i < k) plus strong L” 
convergence of the second factor, with q~* + r~' < 1.) 

4. Localize the result (6.31)—(6.33) = (6.34), replacing M by an open set 2 C R”. 
(Hint: Apply a cutoff x € Cg? (Q).) 

5. (The div-curl lemma.) Let dim M = 3, and let X, and Y, be two sequences of vector 
fields such that 


X, — X weakly in L?', Y, — Y weakly in L??, 


F : —1 : =I; 
div X, compact in H~~’?!, curl Y, compact in H~ *’?? 
? ? 


where 1 < pj < ~, pit +p2' < 1. Show that X, -Y, — X-Y in D’. Formulate 
the analogue for dim M = 2. 
6. Let F, : R” — R” bea sequence of maps. Assume 


(6.35) F, — F weakly in H""(R"). 
Show that 
(6.36) det DF, — det DF inD’(R"). 


(Hint: Set oj, = daj, = Fy dx;.) 
More generally, if 2 < k < n and 
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(6.37) F, — F weakly in H'’*(R”), 
then 

(6.38) A‘ DF, — A‘ DF inD’(R”), 
and hence 

(6.39) Tr A“ DF, > Tr A*DF inD’(R”). 
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We will apply material developed in 88 5 and 6 to study spectral properties of the 
Laplace operator A on L?-spaces. We first consider A on L?(M), where M isa 
compact Riemannian manifold, without boundary. For any \ > 0, (A — A)~+ is 
bijective on D’(M), and results of §6 imply (A — A)~!:L?(M) — H??(M), 
provided 1 < p < oo. Thus if we define the unbounded operator A, on 
L?(M) to be A acting on H?-?(M), it follows that A, is a closed operator with 
nonempty resolvent set, and compact resolvent, hence a discrete spectrum, with 
finite-dimensional generalized eigenspaces. Elliptic regularity implies that each 
of these generalized eigenspaces consists of functions in C™(/), and then these 
functions are easily seen to be actual eigenfunctions. Thus, in such a case, the 
L?-spectrum of A coincides with its L?-spectrum. 

It is desirable to mention properties of A,, related to spectral properties. In 
particular, the heat semigroup e’“ defines a strongly continuous semigroup H, p(t) 
on L?(M), for each p € [1,00). For p € [2,00), this can be seen by applying 
the L?-theory, the maximum principle (for data in L°), and interpolating, to get 
H,(t) : L?(M) + L?(M), for p € [2, co]. Strong continuity for p < oo follows 
from denseness of C™ (A) in L?(M). Then the action of H,(t) as a semigroup 
on L?(M) for p € (1,2) follows by duality. One can also take the adjoint of the 
action of e'“ on C(M) to get e' acting on IN(M), the space of finite Borel mea- 
sures on M, and e‘“ then preserves L!(M), the closure of C°°(M) in N(M). 

Alternatively, the strongly continuous action of the heat semigroup on L?(M) 
for p € [1, 00) can be perceived directly from the parametrix for e’ constructed 
in Chap. 7, § 13. 

Let K be a closed cone in the right half-plane of C, with vertex at 0. Assume 
K is symmetric about the positive real axis and has angle a € (0,7). If P(z): 
X — X is a family of bounded operators on a Banach space X, for z € K, 
we say it is a holomorphic semigroup if it satisfies P(z,)P(z2) = P(z1 + 22) 
for z; € K, is strongly continuous in z € K, and is holomorphic in the interior, 


z € K. The strong continuity implies that || P(z)|| is locally uniformly bounded 
on K. 

Clearly, e’“ gives a holomorphic semigroup on L?(M). Also, e*4 f is defined 
in D'(M) whenever f € D’(M) and Re z > 0, and e*4 f € C®(M) when 
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Re z > 0. Also u(z,a) = e*4 f(x) is holomorphic in z in {Re z > 0}. This 
establishes all but one “small” point in the following. 


Proposition 7.1. e** defines a holomorphic semigroup H(z) on L?(M), for 
each p € [1, 00). 


Proof. Here, K can be any cone of the sort described above. It remains to establish 
strong continuity, H,(z)f — f in L?(M) as z > OinK, for any f € L?(M). 
Since C'°(M) is dense in L?(M), it suffices to prove that {H,(z) : z € K,|z| < 
1} has uniformly bounded operator norm on L?(M/). This can be done by check- 
ing that the parametrix construction for e’“ extends from t € Rt to z € K, 
yielding integral operators whose norms on L?(M) are readily bounded. The 
reader can check this. 


Since the heat semigroup on L?(Q) for a compact manifold with boundary has 
a parametrix of a form more complicated than it does on L?(M), this “small” 
point gets bigger when we extend Proposition 7.1 to the case of compact mani- 
folds with boundary. 

Here is a useful property of holomorphic semigroups. 


Proposition 7.2. Let P(z) be a holomorphic semigroup on a Banach space X, 
with generator A. Then 


(7.1) t>0, fe x => P(t)f € D(A) 
and 

C 
12) |AP(fllx < CIifllx, ford <t<1. 


Proof. For some a > 0, there is a circle y(t), centered at t, of radius alt|, such 
that y(t) € K, for all t € (0, 00). Thus 


03) APWF=POF=-5— f t- 0 PEOT de. 


y(t) 


Since || P(¢) f || < Coll f]| for ¢ € K, |C| < 1+ a, we have (7.2). 
In particular, we have that, for p € (1,00), O0<t<1, 


C 
(7.4) fe L(M) => lle fllz22an < FWFllzec, 


where C' = C,. This result could also be verified using the parametrix for ef. 


Note that applying interpolation to (7.4) yields 
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(7.5) Ile fll zs.2 a) < Cir tl? lf llzecn), for 0 <s< 2, O0<t < 1, 
when p € (1,00), C = C,. We will find it very useful to extend such an estimate 
to the case of e’ acting on L?(Q) when Q has a boundary. 
__ We now look at A onacompact Riemannian manifold with (smooth) boundary 
Q, with Dirichlet boundary condition. Assume 2 is connected and 0Q 4 @. We 
know that, for A > 0, 
(7.6) R, =(A— A)71: PQ) 3 19), 


with range H?(Q) 9 Hj (Q). We can analyze Ry f for f € L®(Q) by noting that 
Ry) is positivity preserving: 


(7.7) A>0, 9g >00nD = Ryg > 0onQ, 


a result that follows from the positivity property of e’“ and the resolvent formula. 
From this and regularity estimates on R,1, it easily follows that, for \ > 0, 


(7.8) Ry: C(Q) = C(Q) and Ry: L&(Q) > LQ). 
Taking the adjoint of Ry acting on C (Q), we have Ry acting on m(Q), the space 
of finite Borel measures on 2. Since the closure of L?(Q) in 9N(Q) is L'(Q), we 
have 
(7.9) Ry: L1(Q) > L'(Q). 
Interpolation yields 
(7.10) Ry: L?(Q) — TPQ), 1L<p<o. 

We next want to prove that 
(7.11) Ry : L?(Q) — H??(Q), p € (1,00), 
when > 0. To do this, it is convenient to assume that Q C M, where M is 
a compact Riemannian manifold without boundary, diffeomorphic to the double 
of 9. Let R : M > M be an involution that fixes OO and that, near OM, is the 
reflection of each geodesic normal to OQ about the point of intersection of the 
geodesic with 02. Then extend f to be 0 on M \ Q, defining f, and define v by 
(7.12) (A-A)v=f onM, 
so v € H*:?(M). Set u = Ry f. Take 


(7.13) u(x) = v(x) —v(R(z)), cen. 
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With v"(x) = v(R(x)), we have (L — A)v"(x) = f(R(a)), where L is the 
Laplace operator for R*g, the metric on M pulled back via R. Thus L = A+ L?, 
where L? is a differential operator of order 2, whose principal symbol vanishes on 
OQ. Thus u; € H?-?(Q), uy = 0 on OQ, and w; = u — uy satisfies 


(7.14) (A-A)jwi=rionQ, wi|,, = 9, 
with 
(7.15) ry =(A-A)v"|, = —L’o" |. 


It follows from (5.49) that 
(7.16) Ly" |, € HYP (Q) c LP(9), 


for some pz > p. If pz < ov, repeat the construction above, applying it to (7.14), 
to obtain 


(7.17) Wy =Ugt+W2, U2 € HQ), uala, = 0, 

and 

(7.18) (A-A)we=r20nQ, wel,,=0, re € HY??(Q) C LP5(Q). 

Continue, obtaining 

(7.19) Ustyte tut wpe, uy € H?"9(0), uzl,, = 0, 

such that 

(7.20) (A—A)wy=rj on, w;|,,=0, ry € BYP (Q) c LQ), 
We continue until p, > n = dim. At this point, we use a couple of results 

that will be established in the next section. Given s € (0, 1), let C*({) denote the 

space of Holder-continuous functions on (2, with Hélder exponent s. We have 

(7.21) ry € HYPr(Q) c C8), 


for some s € (0,1), appealing to Proposition 8.5 for the last inclusion in (7.21). 
Then the estimates in Theorem 8.9 imply 


(7.22) we € C2*9(Q) C H*P(Q). 


This proves (7.11). 
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Arguments parallel to those used for M show that the heat semigroup e“*, de- 
fined a priori on L?(Q), yields also a well-defined, strongly continuous semigroup 
H,(t) on L?(Q), for each p € [1,00). If A,, denotes the generator of the heat 
semigroup on L?(Q), with Dirichlet boundary condition, then (7.11) implies 


(7.23) D(A,) C H*?(Q), pe (1,00). 


We see that A,, has compact resolvent. Furthermore, arguments such as used 
above for M show that the spectrum of A,, coincides with the L?-spectrum of A. 
We now extend Proposition 7.1. 


Proposition 7.3. For p € (1,00), e** defines a holomorphic semigroup on 
L?(Q), on any symmetric cone K about R* of angle <n. 


Proof. As in the proof of Proposition 7.1, the point we need to establish is the 
local uniform boundedness of the L?(Q)-operator norm of e*4, for z € K. In 
other words, we need estimates for the solution u to 


Ou 
(7.24) a Auonk x, u0)=f, ulpveg =9, 
of the form 
(7.25) lu@)\lzoay < Cllfilze@y, teEkK, Ret <1. 


By duality, it suffices to do this for p € (1, 2]. The case p = 2 is obvious, so for the 
rest of the proof we will assume p € (1,2). We will also assume n = dim > 1, 
since the reflection principle works easily when n = 1. 

To begin, define v by 


(7.26) a =AvonKxM, v(0) =f € L?(M), 


where fis f on Q, zero on M \Q. Making use of Proposition 7.2, which we know 
applies to e’“ on L?(M), we have 


(7.27) I|v(t) |lzecany < Clt|*/? |Ifllze@)- 

Now, if R : M — M is the involution on M used above, for x € 2) we set 
(7.28) ui(t,v) = v(t, xz) — v(t, R(z)); us € C(K, L?(Q)). 

We have 


Ou 
(7.29) Sr = Au tg onk xO, mO=f, Ulexag =9, 
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and, by an argument parallel to (7.16), we derive from (7.27) an estimate 


(7.30) llg®)|lzeqay < Cle? II f ll ze (a)- 


In this case, we replace appeal to (5.49) by the parametrix construction for e’“ on 


D'(M) made in Chap. 7, § 13. 

We regard wu, as a first approximation to u, but we seek a more accurate ap- 
proximation rather than rely on an estimate at this point of the error. So now we 
define v2 by 


(7.31) oa =Avy—JonK x M, v9(0) =0, 


where g is g on K x Q and zero on K x (M \ Q). We have 


t 
(7.32) vo(t) = -{ e458) ds, 
0 


and the estimate ||9(s) || z>(a2) < C|s|~!/? from (7.30), together with the operator 
norm estimate of e‘~*)4 on L?(M), from Proposition 7.2, yields 


(7.33) v2 € C(K, H?(M)). 


Now, for x € Q, set 


(7.34) Ug(t, x) = v2(t, x) — v(t, R(x)); use € C(K, H*?(Q)). 
Thus 
Ou2 

(735) = Aw—gtg2onK xO, 20) =0, wlexog =9, 
and we have, parallel to but better than (7.30), 
(7.36) Ilg2(t)|ze(ay < Cll fll ze(ay- 

Next, solve 
(7.37) as = Av3 —g,onK x M,  v3(0) = 0, 


where g2 is gz on K x Q and zero on K x (M\Q). The argument involving (7.32) 
and (7.33) this time yields the better estimate 


(7.38) v3 € C(K, H?-*?(M)), Ve>0, 
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hence, by the Sobolev imbedding result of Proposition 6.4, with s = 1 —¢, 


np 


(7.39) v3 € C(K,H'”3(M)), p3 = S=25 > p, 
provided p < n. Now we set 
(7.40) u3(t, 7) = v3(t, x) — u3(t, R(a)); uz € C(K, H?3 (9), 
and we get 
Ou3 

(7.41) -— Au3 — g2+g3onK xQ, us(0)=0,  usleyag = 9; 
with the following improvement on (7.36): 
(7.42) l93(t)Ilb2s(ay < Cll fll zea). 

Continuing in this fashion, we get 
(7.43) uj € C(K, H?-©?i-1(9)) C C(K, H's (Q)), 


with p = po < p3 <--- /. Given p © (1,2), some px is > 2. Then uz € 
C(K, H'(Q)) satisfies 


Our 


(7.44) SF = Aun — ge-1 + 9n NK XO, uK(0)=0,  Urlexag = 9 
with 
(7.45) gx € C(K, L°(Q)). 
Now we solve for w the equation 
Ow 
(7.46) a Aw—g,onK\Q, w(0)=0, wWieyaq = 0: 


The easy L?-estimates yield 

(7.47) w € C(K, H?-*(Q)), 
and the solution to (7.24) is 

(7.48) US=Utes+URtw. 


This proves the desired estimate (7.25), for p € (1,2), which is enough to prove 
Proposition 7.3. 
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We mention that an interpolation argument yields that e* is a holomorphic 
semigroup on L?(Q) on a cone K that is symmetric about R* and has angle 
n(1 — |2/p — 1)). (See [RS], Vol. 2, p. 255.) This result is valid even if 2 has 
nasty boundary, as well as in other settings. On the other hand, ingredients of the 
argument used above will also be useful for other results, presented below. 

Note that once we have the holomorphy of e' on L?(Q), for all p € (1,00), 
we can apply Proposition 7.2. In particular, suppose we carry out the construction 
of the wu; above, not stopping as soon as pz > 2, but letting p; become arbitrarily 
large. Then (7.44) is replaced by gz, € C (K, [Pk (Q)), and we can now apply 
Proposition 7.2 to improve (7.47) to 


(7.49) w € C(K, H?-©Pk(9)), 


making use of (7.2), (7.11), and interpolation to estimate the norm of eA 
L?(Q) + H?-=P(Q). 

We now consider the construction (7.24)-(7.44) when u(0) = f € L%°(Q). We 
will restrict attention to t € R™. A direct inspection of the parametrix for the heat 
kernel, constructed in Chap. 7, § 13, shows that e’4 : L°(M) — C!(M), with 
norm < Ct~1/?, for t € (0, 1], so v in (7.26) satisfies the estimate ||u(t)||c1cw) < 
Ct-1/?|| f || (a), and || (t) || ¢1¢q) Satisfies a similar estimate. Thus g in (7.29) 
satisfies the estimate (7.30), with p = oo, and consequently v2 in (7.32) satisfies 
Ilv2(®) lox) < C. Hence ||u2(t)|[o1@) < C, and ge in (7.35) satisfies (7.36) 
with p = oo. Thus u = uy, + u2 + w, where w satisfies 


a 
(7.50) o* = Aw —g2onR*+ x, w(0)=0, 


aT 0. 


Wheexoa = 
By the holomorphy of e*“ on L?(Q) for p € (1, 00), we have 
(7.51) w € C((0, 00), H7-*?(Q)), 


for any ¢ > 0 and arbitrarily large p < 00, hence w € C(R*,C?-°(Q)), for any 
} > 0. We deduce that 


(7.52) lle“ flleag@ < Ct”? |Ifllzo@, O<t<1. 

The estimate (7.52), together with the following result, will be useful for the 
study of semilinear parabolic equations on domains with boundary, in §3 of 
Chap. 15. 

Proposition 7.4. If Q is a compact Riemannian manifold with boundary, on 


which the Dirichlet condition is placed, then e'* defines a strongly continuous 
semigroup on the Banach space 


(7.53) CO, (Q) = {f € C*Q) : flan = 0}. 
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Proof. It is easy to verify that, for N > 1+ (dim M)/4, 
D(A) c Ch (Q), densely. 


Since e'4 is a strongly continuous semigroup on D(A), it suffices to show that 


for each f € C}(Q), {e'4f : 0 < t < 1} is uniformly bounded in Lip(Q). To see 
this, we analyze solutions to 


a = Au, forxeQ, u(0,2) = f(x), u(t,x) =0, for x € OQ, 


when 
(7.54) free ®). gieo=% 


We will to some extent follow the proof of Proposition 7.3, and also use that result. 
In this case, for f equal to f on Q and to zero on M \ Q, we have f € Lip(M). 
Thus, for v defined by 


2 = AvonR* xM, v(0)=f, 


we have 
(7.55) v €C(R*, Lip(M)), 


where the “C” stands for “weak” continuity in t, (i.e., v(t) is bounded in Lip(M) 
and continuous in t, with values in H!*?(/), for each p < 00). Hence 


ui(t, x) = v(t, x) — u(t, R(2)) pect 


satisfies 
(7.56) ULE C(R*, Lip(Q)). 
We have 
UL 
oe =Am+t+g, w(0)=f, tin iene =; 
where 


_— yobyr 
g=Lv lees 


Here, as in (7.15), L is a second-order differential operator whose principal sym- 
bol vanishes on 0Q, and v"(x) = v(R(x)). Consequently, again an analogue of 
(5.49) gives 


(7.57) g €C(RT,L™(Q)). 
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Now, we have u = u; + w, where w satisfies 


(7.58) >, =Aw-g, w(0)=0, w r+xan = 9 


and, by (7.57), g € C(R*, L?(Q)), for all p < oo. This implies 


(7.59) we O(Rt, H?-©?(Q)), Vp < oo, e>0, 


A 


since e'“ is a holomorphic semigroup on L?(Q). This proves Proposition 7.4. 


Exercises 
1. Extend results of this section to the Neumann boundary condition. 


In Exercises 2 and 3, let 22 be an open subset, with smooth boundary, of a com- 
pact Riemannian manifold M. Assume there is an isometry 7 : M — M that is an 
involution, fixing OQ, so M is the isometric double of Q. 

2. Suppose X; are smooth vector fields on Q, f; € L?(Q) for some p € [2, 00), and u is 
the unique solution in Hj’? (Q) to 


Au => weer 


Show that u € H+?(Q). (Hint: Reduce to the case where each Xj is a smooth vector 
field on M, such that r4.X; = +X;. Extend f; to f; € L?(M), so that r* f; = = f;. 
Thus )> Xj f; € H~'?(M) is odd under 7.) 

3. Extend the result of Exercise 2 to the case f; € L?(Q) when 1 < p < 2, appropriately 
weakening the a priori hypothesis on w. 

4. Try to extend the results of Exercises 2 and 3 to general, compact, smooth Q, not nec- 
essarily having an isometric double. 

5. Show that (7.5) can be improved to 


Ry: L™(Q) — C(Q), 
for A > 0. (Hint: Use (7.11). Show that, in fact, for A > 0, 
Ry: L™(Q) + CQ), Vr <2.) 


A sharper result will be contained in (8.54)-(8.55). 


8. Hdélder spaces and Zygmund spaces 


If 0 < s < 1, we define the space C’*(R”) of Hélder-continuous functions on R” 
to consist of bounded functions w such that 


(8.1) |u(x + y) — u(x)| < Clyl*. 
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For k = 0,1,2,..., we take C*(IR”) to consist of bounded, continuous functions 
u such that D°u is bounded and continuous, for |G| < k. If s = k+r, 0 < 
r < 1, we define C*(IR") to consist of functions u € C*(IR”) such that, for 
|3| =k, D%u belongs to C’(R”). 

For nonintegral s, the Hélder spaces C'*(R”) have a characterization similar 
to that for LZ? and more generally H*:?, in (5.46) and (6.23), via the Littlewood— 
Paley partition of unity used in (5.37), 


i= Sey, 
j=0 
with :; supported on (€) ~ 27, and yj (€) = y1(2'%E) for j > 1. Let y;(€) = 
5 (E)?. 
Proposition 8.1. [fu € C*(R”), then 
(8.2) sup 2°*[[be(D)ullz= < 00. 


Proof. To see this, first note that it is obvious for s = 0. For s =  € Z*, it then 
follows from the elementary estimate 


C1 2""\|Ux,(D)u(x)||b~ < 2 I|~(D)D°u(z)|| L~ 
(8.3) ja|<é 
< C22" ||d,(D)u(z)||r~. 


Thus it suffices to establish that u € C’* implies (8.2) for 0 < s < 1. Since ¢1(2) 
has zero integral, we have, for k > 1, 


(8.4) 


which is readily bounded by C - 2-*°. 


This result has a partial converse. 
Proposition 8.2. [f s is not an integer, finiteness in (8.2) implies u € C*(R"). 


Proof. It suffices to demonstrate this for 0 < s < 1. With Wx (&) = D0 j<;, Wj (€); 
if |y| ~ 2-*, write 


u(ae + y) — (ae) = 7 y-VWq(D)u(a + ty) dt 


+ (I —W,(D)) (u(x + y) — u(z)) 


(8.5) 
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and use (8.2) and (8.3) to dominate the L°°-norm of both terms on the right by 
C -2-, since ||[V;,(D)ullz0 < C-20-9)*, 


This converse breaks down if s € Z*. We define the Zygmund space C3(R”) 
to consist of u such that (8.2) is finite, using that to define the C$-norm, namely, 


(8.6) llullos = oup 2" \Ib4(D)ull r=. 
Thus 
(8.7) Ci=C ifecRt\Z", C' cc’, kez. 


The class C'3(IR”) can be defined for any s € R, as the set of elements u € S’(R”) 
such that (8.6) is finite. 

The following complements previous boundedness results for Fourier multi- 
pliers P(D) on L?(R”) and on H*-?(R”). 


Proposition 8.3. If P(E) € Sj’(R”), then, for all s € R, 
(8.8) P(D): C2 — o8-™, 


Proof. Consider first the case m = 0. Pick w,(€) € C3°(IR”) such that 7); (€) = 1 
on supp wy; and 0; (€) = o1(2!-J€), for j > 2. It follows readily from the analysis 
of the Schwartz kernel of P(D) made in § 2 of Chap. 7, particularly in the proof 
of Proposition 2.2 there, that 


(8.9) P(é) € S?(R”) = sup ||¥5(6)P(O)llzz2 < 00, 
J 


where ||Q||-r1 = Olin. Also, it is clear that 
(8.10) lIYe(D)P(D)ullr~ < Clb. Plless - lve(D)ullr~, 


which implies (8.8) for m = 0. The extension to general m € R is straightfor- 
ward. 


In particular, with A = (1 — A)!/?, 
(8.11) A™ : Cf —> C3—™ is an isomorphism. 
Note that in light of (8.9) and (8.10), we have 


(8.12) — ||P(D)ullos < C sup | PO (€)(E)! [ne = [ful 


€ER”,a|<[n/2]+1 


Cs. 
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In particular, for y € R, 


(8.13) |A®ullos < Cly)”/?*? llullos. 


Compare with (5.47). 
The Sobolev imbedding theorem, Proposition 6.3, can be sharpened and ex- 
tended to the following: 


Proposition 8.4. For all s © R, p € (1,00), 


(8.14) H®?(R") C CT(R"), r=s-— = 


Proof. In light of (8.11), it suffices to consider the case s = n/p. Let Lm(E€) € 
Si’(IR”) be nowhere vanishing and satisfy L,,(€) = |&|, for || > 1/100. It 
suffices to show that, for p € (1,00), 


(8.15) Ilvs(D)L—n/p(D)ulln~ < Cllullze cer), 


with C' independent of &. We can restrict attention to k > 2. Then A;,(€) = 
Wr (€)L_n/p(E) satisfies 


Agyi(é) = 27%!” Ay (27*8). 
Hence Aji (x) € S(R”) and 
(8.16) || Ag+ill po’ (gny =C; independent of k > 2. 


Thus the left side of (8.15) is dominated by |Agll pe - ||u|| zp», which in turn is 
dominated by the right side of (8.15). This completes the proof. 


It is useful to extend Proposition 8.3 to the following. 
Proposition 8.5. If p(x, €) € Sj%o(IR”), then, for s € R, 
(8.17) p(a,.D) : C2(R") — Ce-™(R”). 
Proof. In light of (8.11), it suffices to consider the case m = 0. Also, it suffices 


to consider one fixed s, which we can take to be positive. First we prove (8.17) in 
the special case where p(x, €) has compact support in z. Then we can write 


(8.18) p(«, Du = eo Qn(D)u dn, 


with 
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(8.19) an(€) = (20) / e?-7 n(x,£) dr. 


Via the estimates used to prove Proposition 8.3, it follows that, for any given 
s € R,q,(D) € £(C8(R")) has an operator norm that is a rapidly decreasing 
function of 77. It is easy to establish the estimate 


(8.20) ile" | 


es SCs) (n)* |e 


Gs (s > 0), 


first for s ¢ Z*, by using the characterization (8.1) of C*’ = C$, then for general 
s > 0 by interpolation. The desired operator bound on (8.18) follows easily. 

To do the general case, one can use a partition of unity in the x-variables, of 
the form 


1= SY g(x), 9;(x) = yo(a +5), Yo € CH(R"), 
jean 


and exploit the estimates on p,(a,D)u = y;(x)p(x, D)u obtained by the ar- 
gument above, in concert with the rapid decrease of the Schwartz kernel of the 
operator p(a, D) away from the diagonal. Details are left to the reader. 


In § 9 we will establish a result that is somewhat stronger than Proposition 8.5, 
but this relatively simple result is already useful for Hélder estimates on solutions 
to linear, elliptic PDE. 

It is useful to note that we can define Zygmund spaces C'(T”) on the torus just 
as in (8.6), but using Fourier series. We again have (8.7) and Propositions 8.3—8.5. 

The issue of how Zygmund spaces form a complex interpolation scale is more 
subtle than the analogous situation for L?-Sobolev spaces, treated in § 6. A differ- 
ent type of complex interpolation functor, [X,Y], defined in Appendix A at the 
end of this chapter, does a better job than [X, Y]9. We have the following result 
established in Appendix A. 


Proposition 8.6. For r,s € R, 6 € (0,1), 
(8.21) CO, crm c=c' or, 


It is straightforward to extend the notions of Hélder and Zygmund spaces to 
spaces C*(M) and C$(M) when M is a compact manifold without boundary. 
Furthermore, the analogue of (8.14) is readily established, and we have 


(8.22) P:03(M) + C8-™(M) if P € OPST,(M). 


If Q is a compact manifold with boundary, there is an obvious notion of C*(Q), 
for s > 0. We will define C$(Q) below, for s > 0. For now we look further at 
C%(Q). The following simple observation is useful. Give 2. a Riemannian metric 
and let 6(”) = dist(#, 09). 
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Proposition 8.7. Let r € (0,1). Assume f € C'(Q) satisfies 
(8.23) \Vi(z)|<Cé(a)"", ren. 
Then f extends continuously to Q, as an element of C" (Q). 


Proof. There is no loss of generality in assuming that 2 is the unit ball in R”. 
When estimating f(x2) — f(x1), we may as well assume that 7 and a» are a 
distance < 1/4 from OQ. and |x, — r2| < 1/4. Write 


faa) — f(e1) = ; df («), 


where 7 is a path from x to x2 of the following sort. Let y; lie on the ray segment 
from 0 to xj, a distance d = |x, — x| from x;. Then + goes from x, to y; ona 
line, from y; to yz on a line, and from y2 to x2 on a line, as illustrated in Fig. 8.1. 
Then 


1 d 
(8.24) |f(a;)-fy)|<C f (-p)"*dp=C | dar oe 
1-d 0 


while 

(8.25) f(y) — F(y2)| < Clay — wld” * < Cd", 
so 

(8.26) |f (v2) — f(v1)| < Cla — 22)", 


as asserted. 


FIGURE 8.1 Path from 21 to x2 
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Now consider 2 of the form Q = [0,1] x M, where M is a compact 
Riemannian manifold without boundary. We want to consider the action on 
f € C"(M) of a family of operators of Poisson integral type, such as were studied 
in Chap. 7, § 12, to construct parametrices for regular elliptic boundary problems. 
We recall from (12.35) of Chap. 7 the class OPP~! consisting of families A(y) 
of pseudodifferential operators on /, parameterized by y € (0, 1]: 


(8.27)  A(y) € OPP? <=> y* Di A(y) bounded in OPS; 8 ***(M). 
Furthermore, if L € OPS1(M) is a positive, self-adjoint, elliptic operator, then 
operators of the form A(y)e~¥”, with A(y) € OPP~I, belong to OPP>!. In 
addition (see (12.50)), any A(y) € OPP; can be written in the form e~¥” B(y) 
for some such elliptic L and some B(y) € OPP~!. The following result is useful 
for Hélder estimates on solutions to elliptic boundary problems. 
Proposition 8.8. If A(y) € OPP! and f € C’(M), then 
(8.28) u(y, £) = A(y) f(x) => ue C7*"(I x M), 
provided j +r € Rt \ Zt. 

Note that we allow r < Oif 7 > 0. 


Proof. First consider the case 7 = 0, 0 < r < 1, and write 
(8.29) Aly) f =e"™* Biy)f, Bly) € OPP®. 


We can assume without loss of generality that A = (1—A) 1/2 and we can replace 
M by R”. In such a case, we will show that 


(8.30) |Vycu(y,2)| < Cy" |luller 


if 0 <r < 1, which by Proposition 8.7 will yield u € C’(I x M). Now if we set 
0; = 0/0x; for 1 <7 <n, 09 = O/Oy, then we can write 


(8.31) yOju(y,z) = yAe AB, (y)f, B,(y) € OPP®. 


Now, given f € C7™(M), 0 < r < 1, we have B;(y)f bounded in C’(M), for 
y € [0,1]. Then the estimate (8.30) follows from 


(8.32) leA)gliz~ < Cy" Ilgller, 


for 0 <r < 1, where y(A) = Ae~*, which vanishes at \ = 0 and is rapidly 
decreasing as \ —> +00. In turn, this follows easily from the characterization (8.6) 
of the Cf-norm. 
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If fe C**"(M), ke Z*+, 0<r <1, andj =0 then given |a| < k, 
(8.33) De u=e AB (y)A*f, Baly) € OPP®, 


so the analysis of (8.29), with f replaced by A¥* f, applies to yield Dy zt € CT(Ix 
M), for |a| < k. 

Similarly, the extension from j = 0 to general j € Z™ is straightforward, so 
Proposition 8.8 is proved. 


As we have said above, Proposition 8.8 is important because it yields Hélder 
estimates on solutions to elliptic boundary problems, as defined in Chap. 5, § 11. 
The principal consequence is the following: 


Theorem 8.9. Let (P,B;,1 < j < @) be a regular elliptic boundary problem. 
Suppose P has order m and each B; has order mj. If u solves 


(8.34) Pu=O0onQ, Bju=g; on dQ, 
then, forr € Rt \ Z*, 
(8.35) 93 € Ce "7 (0D) = we cr). 


Proof. Of course, u € C'°°(Q). On a collar neighborhood of OQ, diffeomorphic 
to [0, 1] x OQ, we can write, modulo C® ([0, 1] x OQ), 


(8.36) u=S°Q;(y)9;, Qj(y) € OPP2™, 


by Theorem 12.6 of Chap.7, so the implication (8.35) follows directly from 
(8.28). 


We next want to define Zygmund spaces on domains with boundary. Let 2 be 
an open set with smooth boundary (and closure Q) in a compact manifold M. We 
want to consider Zygmund spaces C7 (Q), r > 0. The approach we will take is to 
define C’ (Q) by interpolation: 


b 


(8.37) CLO) =|" (),07 OQ), 


where 0 < 51 <r < 52,0<0<1, r= (1—6)s) + O82 (and s; ¢ Z). As in 
(8.21), we are using the complex interpolation functor defined in Appendix A. We 
need to show that this is independent of choices of such s;. Using an argument 
parallel to one in § 6, for any N € Z*, we have an extension operator 


(8.38) E:C*(Q) —+C*(M), s€(0,N)\Z, 
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providing a right inverse for the surjective restriction operator 
(8.39) p: C*(M) —> C*(Q). 


From Proposition 8.6, we can deduce that whenever r > 0 and s; and 6 are 
as above, C7(M) = [C*'(M), C#(M)]5. Thus, by interpolation, we have, for 
r > 0, 


(8.40) E:Cl(Q) — Cf(M),  p: Ci(M) > C7), 
and pE = I on C’(). Hence 
(8.41) Cl (Q) © CT(M)/{u € Cl(M) : ul, = 0}. 


This characterization is manifestly independent of the choices made in (8.37). 
Note that the right side of (8.41) is meaningful even for r < 0. 

By Propositions 8.1 and 8.2, we know that C?(M/) = C"(M), forr € Rt\Z*, 
so 


(8.42) CL(Q) = C™(Q), forre Rt\ Zt. 


Using the spaces C’(), we can fill in the gaps (at r € Z*) in the estimates of 
Theorem 8.9. 


Proposition 8.10. [f (P,B;,1 < 7 < £) is a regular elliptic boundary problem 
as in Theorem 8.9 and u solves (8.34), then, for all r € (0,00), 


(8.43) 93 € Ce "7 (00) = we CTO). 


Proof. For r € Rt \Z*, this is equivalent to (8.35). Since the solution u is given, 
mod C™(Q), by the operator (8.36), the rest follows by interpolation. 


In a sense, the C2-norm is only a tad weaker than the C°-norm. The following 
is a quantitative version of this statement, which will prove very useful for the 
study of nonlinear evolution equations, particularly in Chap. 17. 


Proposition 8.11. [fs > n/2 + 6, then there is C < co such that, for all € € 
(0, 1], 


1 
(8.44) lla < Ce*jullire + C(log =) lulls. 


Proof. By (8.6), ||u||co = sup ||Px(D)ul|r~. Now, with Vj = >7,<; Ye, make 
the decomposition u = U;(D)u+ (1 — U;(D))u; let = 2~/. Clearly, 


(8.45) |W (D)ulln~ < Jllulles. 
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Meanwhile, using the Sobolev imbedding theorem, since n/2 < s — 6, 


|(1 — 8; (D) )ullr~ < Cl] — ¥j(D))ullas—s 
()) < C2 |\(1—¥;(D))ullzs, 


the last identity holding since {2/°(€)~°(1 — W,(€)) : j € Zt} is uniformly 
bounded. This proves (8.44). 


Suppose the norms satisfy ||u||co < Cllul|z. If we substitute ©? = 


Co1|\ul|co/||u|zs into (8.44), we obtain the estimate (for a new C = C(6d)) 


(8.47) lull < Cllullce 1 ‘Sieg (ae) | 
lullco 


We note that a number of variants of (8.44) and (8.47) hold. For some of them, 
it is useful to strengthen the last observation in the proof above to 


(8.48) {27°(€)-° (1 —W,(€)) : 7 € Z*} is bounded in S?(R”). 


An argument parallel to the proof of Proposition 8.11 gives estimates 


1 
(849) |lulloxqar) < Ce*llullerecan) + C (log =) llullozcan, 


givenk € Zt, s > n/2+k+ 0, and consequently 


ella 
1+1 
ilies (re 


when M is a compact manifold without boundary. __ _ 

We can also establish such an estimate for the C*(Q)-norm when 2 is a com- 
pact manifold with boundary. If 2 C M as above, this follows easily from (8.50), 
via: 


(8.50) ulloecmy < Cllulleecy 


Lemma 8.12. For any r € (0, N), 
(8.51) lullor@) © ||Eullerumy- 


Proof. If Eu; > v in C?(M), then pEu; — pv in C7(Q), that is, u; 4 pv in 


C2 (Q), since pEu; = u;. Thus v = Epv, in this case. This proves the lemma, 


which is also equivalent to the statement that & in (8.40) has closed range. 


We also have such a result for Sobolev spaces: 


(8.52) [Ullare(ay) © ||Lullareay, 1<p<o. 
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Thus (8.50) yields 


llullars(o 
(8.53) Ullon @) < Cllullce@) c be ( © , 


Ilull cw @) 


provided s > n/2+k. 


Exercises 
1. Extend the estimates of Theorem 8.9 and Proposition 8.10 to solutions of 
(8.54) Pu=fonQ, Bju=g; ondQ. 
Show that, for r € (44,00), 4 = max(m;), 
(8.55) fea "@), 93 € Ce "4 (82) => we CFO). 


Note that we allow r — m < 0, in which case C7~™ (Q) is defined by the right side of 
(8.41) (with r replaced by r — m). 
2. Establish the following result, similar to (8.44): 


1\ 1-1/4 
(8.56) Ilullnco < Ce>|lullzs.r + C (log ) | 


[ell n/a.a> 


where s > n/p +6, q € [2,00), and a similar estimate for q € (1, 2], using 
(log 1/e)'/7. (See [BrG] and [BrW].) 
Hint. Establish appropriate analogues of the estimates (8.45) and (8.46). 

3. From (8.15) it follows that H'?(R") C C"(R”) if p > n, r = 1—n/p. Demonstrate 
the following more precise result: 


(8.57) |u(a) — u(y)| < Cle —yl*"/? |Vullzea.y), P >, 
where Bry = Biz—y|(©) OM Biz—y\(y)- 
(Hint: Apply scaling to (2.16) to obtain 
|u(re1) — v(0)| < Cr?™ / |Vu(a)|? dx. 
Br (0) 


To pass from B),_|(x) to Bry in (8.57), note what the support of ¢ is in Exercise 5 of 
§ 2.) There is a stronger estimate, known as Morrey’s inequality. See Chap. 14 for more 
on this. 


9. Pseudodifferential operators with nonregular symbols 


We establish here some results on Hélder and Sobolev space continuity for pseu- 
dodifferential operators p(x, D) with symbols p(a,&) that are somewhat more 
ill behaved than those for which we had L?-Sobolev estimates in Chap.7 or 
L”-Sobolev estimates and Hdélder estimates in §§5 and 8 of this chapter. These 
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results will be very useful in the analysis of nonlinear, elliptic PDE in Chap. 14 
and will also be used in Chaps. 15 and 16. 
Let r € (0, 00). We say p(x, €) € C7S7";(R”) provided 


(9.1) |DEp(a, €)| < Caley” a! 
and 
(9.2) | D2p(-, llorcany < Caleym latter, 


Here 6 € [0,1]. The following rather strong result is due to G. Bourdaud [Bou], 
following work of E. Stein [S2]. 


Theorem 9.1. [fr > 0 and p € (1,00), then, for p(x, §) € CL S74, 
(9.3) p(x, D): Het? —, H*?, 
provided 0 < s <r. Furthermore, under these hypotheses, 
(9.4) p(x, D) : C8+™ —> C8. 

Before giving the proof of this result, we record some implications. Note 
that any p(a,€) € S7", satisfies the hypotheses for all r > 0. Since operators 
in OPS{"; possess good multiplicative properties for 6 € [0,1), we have the 


following: 


Corollary 9.2. If p(x,£) € Sj";, 0 < 6 < 1, we have the mapping properties 
(9.3) and (9.4) for all s € R. 


It is known that elements of OPS? , need not be bounded on L?, even for 
p = 2, but by duality and interpolation we have the following: 


Corollary 9.3. If p(x, D) and p(x, D)* belong to OPS7", then (9.3) holds for 
alls ER. 


We prepare to prove Theorem 9.1. It suffices to treat the case m = 0. Following 
[Bou] and also [Ma2], we make use of the following results from Littlewood— 
Paley theory. These results follow from (6.23) and (6.25), respectively. 

Lemma 9.4. Let fi, € S’(R”) be such that, for some A > 0, 
(9.5) supp fC (ev Aso a ele Ao, bel. 


Say fo has compact support. Then, for p € (1,00), s € R, we have 
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oo eal, sclfScearyL, 


Tf fx = v~(D)f with yp, supported in the shell defined in (9.5) and bounded in 
SY o, then the converse of the estimate (9.6) also holds. 


Lemma 9.5. Let f;, € S’(R”) be such that 
(9.7) supp fr C {€:|E|< A-2*}, k>0. 


Then, for p € (1,00), s > 0, we have 


(9.8) Doe <ol{d ysl mi, 


The next ingredient is a symbol decomposition. We begin with the Littlewood— 
Paley partition of unity (5.37), 


(9.9) 1=>) (6? => HO 

and with 

(9.10) p(z,€) = >> (2, 6)o5(6) = S> pj(2, €). 
j=0 j=0 


Now, let us take a basis of L*(|€;| < 7) of the form 


€a(€) = eis, 


and write (for 7 > 1) 


(9.11) p;(2,€) = >" pja(z)ea(2-FE)¥7 (2), 


where #7 (€) has support on 1/2 < |é| < 2 and is 1 on supp 71, pi (€) = 


wir (29+), with an analogous decomposition for po(€). Inserting these decom- 
positions into (9.10) and summing over j, we obtain p(x, €) as a sum of a rapidly 
decreasing sequence of elementary symbols. 

By definition, an elementary symbol in C7'S : 5 18 of the form 


(9.12) a(x, €) = 9° Qe(x) yx) 
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where y, is supported on (€) ~ 2* and bounded in S?—in fact, yp(€) = 
y1(2-*t1€), for k > 2—and Q;() satisfies 


(9.13) lQx(x)1 < C, []Qallor < C-2*"%. 


For the purpose of proving Theorem 9.1, we take 6 = 1. It suffices to estimate the 
H”-operator norm of q(a, D) when q(x, €) is such an elementary symbol. 

Set Qi; (x) = Yj (D)Qz(x), with {y,;} the partition of unity described in 
(9.9). Set 


k—-4 k+3 co 
g(z,é) = >> Qej(z) + D5 Qeg(z) + SD) Qes(w) ¢ vel) 
k j=0 j=k-3 joHkt+4 
(9.14) = q(x, €) + qa(z, €) + q3(a, €). 


We will perform separate estimates of these three pieces. Set f, = y,(D)f. 
First we estimate gi (a, D) f. By Lemma 9.4, since (€) ~ 2/ on the spectrum 
of Qrj, 


1/2 
co k—-4 
ie 2 
llau(@, D) fll» < Cl, S- 4" |S" Qua fe | : 
k=4 — j=0 
& 1/2 
(0.15) <C {dariaut-LaP| |. 
k=4 
< C f\las, 


for all s € R. 
To estimate go(x, D)f, note that ||Qxj|| n° < C+ 2-/"**". Then Lemma 9.5 
implies 


ag 1/2 
(9.16) Iga, D) fll» < of Sahin | pe < Cl fllaree, 
k=0 


for s > 0. 
To estimate q3(x, D) f, we apply Lemma 9.4 to h; = S| Qxj fx, to obtain 


1/2 
fore) j-4 
: ; 
ase, D) fila» < Cl] S04 |S Oeste” pI, 
j= k=0 
- J 2\ 1/2 
(9.17) < c| sae > 214 | 

j= k=0 
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Now, if we set g; = SS 2(k-J)"| f,,| and then set G; = 2/°g; and F; = 2/°|f; 
we see that 


> 


j-4 
G; = S- gitar a) a. 
k=0 


As long as r > s, Young’s inequality (see Exercise | at the end of this section) 
yields ||(G;)||22 < C||(F5) ||z2, so the last line in (9.17) is bounded by 


1/2 


cl raise | < Clif. 
j=0 


This proves (9.3). 
The proof of (9.4) is similar. We replace (9.6) by 


(9.18) IIf llor ~ sup 2** IIb. (D)fllz~, 7 > 0. 


We also need an analogue of Lemma 9.5: 


Lemma 9.6. If fz € S!(IR”) and supp fe C {€ : \E| < A-2**}, then, for r > 0, 


(9.19) Du sel 
k=0 


or < Csup 2°" || fxll ze. 
* k>0 


Proof. For some finite N, we have )j(D) iso fe = ¥j(D) esj—w See Sup- 
pose sup, 2°"|| fx||z00 = S. Then 


[e(D) 7 fell SS SD ar s ctge-®. 
k>0 


k>j—-N 
This proves (9.19). 


Now, to prove (9.4), as before it suffices to consider elementary symbols, of the 
form (9.12)-(9.13), and we use again the decomposition q(z,€) = qi + q2 + 43 
of (9.14). Thus it remains to obtain analogues of the estimates (9.15)—(9.17). 

Parallel to (9.15), using the fact that oe Qxj(x) fx has spectrum in the shell 
(€) ~ 2*, and ||Qz||z~ < C, we obtain 


k-4 


lln(«, D) filo: < Csup 2**||S> Qej fell. 
k>0 j=0 


(9.20) < Csup 2**|| felln~ 
k>0 


< Cllfiies; 
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for all s € R. Parallel to (9.16), using ||Qx;||n-2 < C -2-9"+*" and Lemma 9.6, 
we have 


Ilq2(x, D)f| 


foe) 
oc: S do glk 
k=0 


(9.21) < Csup 2"*|\ge\|L~ 
k>0 


< Csup 2"*||frlln~ < Clifiles, 
k>0 


for all s > 0, where the sum of seven terms 


k+3 


s Cu@h 


j=k—3 


has spectrum contained in |€| < C’- 2, and ||g9x||L2 < Cl fell z<- 
Finally, parallel to (9.17), since pa ms Qxj fx has spectrum in the shell (€) ~ 
2), we have 


j-4 


llas(@, D) filo: < Csup 2'*|| > Qa fall poo 
720 k=0 


(9.22) < Csup 20-7) 2 Millon. 
j20 k=0 


If we bound this last sum by 


j-4 
(9.23) bb are ®) sup 2" fill zo 
k=0 
then 
j-4 
9.24 laste, D)flles < C [sup 2-) F72ke-9| 
k=0 


and the factor in brackets is finite as long as s < r. The proof of Theorem 9.1 is 
complete. 

Things barely blow up in (9.24) when s = r. We will establish the following 
result here. A sharper result (for p(a,€) € C7 Sy" with 6 < 1) is given in (9.43). 


56 13. Function Space and Operator Theory for Nonlinear Analysis 
Proposition 9.7. Take r > 0. If p(a,€) © CLSi%, then 
(9.25) p(x, D): CTtTT® —+ CT, foraile > 0. 


Proof. It suffices to treat the case m = 0. We follow the proof of (9.4). The 
estimates (9.20) and (9.21) continue to work; (9.22) yields 


j-4 
llas(x, D) flor S$ Csup $7 2"" || fellz~ 
TE) k=0 
(9.26) = CD02" ll fella 
k=0 
ra cy. gkr, 7 aad | ort, 
k=0 


which proves (9.25). 


The way symbols in C’.S7"; most frequently arise is the following. One has in 
hand a symbol p(x,€) € C{S7", such as the symbol of a differential operator, 
with Hélder-continuous coefficients. One is then motivated to decompose p(x, €) 
as a sum 


(9.27) p(x, €) = p* (a, €) + p(2, 6), 


where p* (x, £) € 5» for some 6 € (0, 1), and there is a good operator calculus 
for p* (x, D), while p?(x, €) € CT Si’ ; (for some ps < m) is treated as a remainder 
term, to be estimated. We will refer to this construction as symbol smoothing. 

The symbol decomposition (9.27) is constructed as follows. Use the partition 
of unity ~;(€) of (9.9). Given p(x, €) € C7 ST", choose 6 € (0, 1] and set 


(9.28) pr (at) = >" di pla eds), 
j=0 


where J. is a smoothing operator on functions of x, namely 

(9.29) Jef(x) = b(eD)f (2), 

with @ € C>°(R”), o(€) = 1 for |é| < 1 (e.g., 6 = Vo), and we take 
(9.30) eo, 


We then define p? (a, ) to be p(x, €) — p* (2, €), yielding (9.27). 
To analyze these terms, we use the following simple result. 
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Lemma 9.8. For € (0, 1], 


(9.31) |DoJefllos < Cg Fl | 


Cy 
and 
(9.32) lf — Jefllos-t < Ce*|Ifllez, fort > 0. 
Furthermore, if s > 0, 
(9.33) lf — Jefllb~ < Cse*|fllcs- 


Proof. The estimate (9.31) follows from the fact that, for each 3 > 0, 
e!"|D% (ED) is bounded in OPS} o, 
and the estimate (9.32) follows from the fact that, with A = (1 — A)!/?, 
At : CS —+ C5“ isomorphically, 
plus the fact that 
e—'A~*(1 — $(eD)) is bounded in OPS} , 
for 0 < e <1. As for (9.33), if e ~ 2-4, we have 


(1 — d(€D)) flln~ < So llve(D)flla= < CD) 2-* ||| 


l25 l25 


Cs; 


and since Vey 2-* < C,2-7* for s > 0, (9.33) follows. 


Using this, we easily derive the following conclusion: 


Proposition 9.9. Take r > 0 and 6 € (0,1). If p(a,€) © CLS{"%, then, in the 
decomposition (9.27), 


(9.34) p* (a,£) € ST 
and 
(9.35) p'(a,é) € TST. 


Proof. The estimate (9.31) yields 
(9.36) IDE DEp* (-, Eller S Capley™ lal", 


which implies (9.34). 
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That p” (zx, €) satisfies an estimate of the form (9.2), with m replaced by m—ro, 
follows from (9.32), with ¢ = 0. That it satisfies (9.1), with m replaced by m—r6, 
is a consequence of the estimate (9.33). 


REMARK. See the exercises for an extension of (9.34) to the case p(z,&) € 
LL Si". 


It will also be useful to smooth out a symbol p(x, €) € C7S1";, for d € (0, 1). 
Pick y € (6,1), and apply (9.28), with e; = 2-90-9), obtaining p# (a, €) and 
hence a decomposition of the form (9.27). In this case, we obtain 


(9.37) p(a,£) € CLS => P#(a,€) € ST, p(a,8) € Crs”. 


We use the symbol decomposition (9.27) to establish the following variant of 
Theorem 9.1, which will be most useful in Chap. 14. 


Proposition 9.10. fd € [0,1) and p(x, €) € CYST"s, then 


p(x, D) : H8t™P —, H*?, 
(9.38) a 
p(x, D): CO" — CG, 
provided p € (1,00) and 
(9.39) -—(1l-d)r<s<r. 


Proof. The result follows directly from Theorem 9.1 if 0 < s < r, so it remains 
to consider s € (-(1 — d)r, 0] . Use the decomposition (9.27), p = p* + p?, with 
(9.37) holding. Thus p* (x, D) has the mapping property (9.38) for all s € R. 
Applying Theorem 9.1 to p?(x, D) yields mapping properties such as 


p’(a,D): H7t™-O-9)rp _, HP, g > 0, 
or, setting s = 0 — (y—4d)r, 
p(x, D): Herm? —_, Hero -har & H’?, s>—-(y—-9)r, 
and similar results on C3*+™. Then letting y /7 1 completes the proof of (9.38). 


Recall that, for r € (0,00), we have defined p(x, £) to belong to the space 
CyS7";(IR") provided the estimates (9.1) and (9.2) hold. If r € [0, 00), we will 
say that p(x,€) € C™S7";(IR") provided that (9.1)-(9.2) hold and, additionally, 


(9.40) [DE 0(-, Hlesmry) < Ca (E™ IN, O<7j<r, jez. 
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In particular, we make a semantic distinction between CS}; and C'"'S7"5 even 
when r ¢ Z*, in which cases C’ and C” coincide. The differences between the 
two symbol classes are minor, especially when r ¢ Z, but natural examples of 
symbols often do have this additional property, and we sometimes use the symbol 
classes just defined to record this fact. 


Exercises 
1. Young’s inequality implies 
If * gllea < WFlle Ilglles, 


where f = (f;), 9 = (gj), and (f * 9); = >o, f;—ng%. Show how this applies (with 
qd = 2) to the estimate of (9.17). 
2. Supplement Lemma 9.8 with the estimates 


|DeJefllz~ < Cliflles, Il <s, 
(9.41) Ce IIflloz, Bl > 8, 
given s > 0. 
3. Show that if p(x, €) € C2Si" has the decomposition (9.27), then 
Dip* (#,£) € St, for |8| <r, 
(9.42) oe for |B] > r. 


4. Strengthen part of Proposition 9.10 to obtain, for 6 € [0,1), r > 0, 
(9.43) plat) € Cl STs => p(a, D): C27" — CE, for —(1-—6)r <8 <r. 
(Hint: Apply Proposition 9.7 to p? (x, D), arising in (9.37).) 


5. Given s € R, 1 < p,q < ov, we say f € S’(R”) belongs to the Triebel space 
F;,q(R”) provided 


0.44) IFlleg.¢ = |{2°e(D)F}pecen.en) <2 
where {,} is the partition of unity (9.9). Note that F>5 = H°? if1 < p < ow, 


by Lemma 9.4. Also, we say that f € S’(R”) belongs to the Besov space B5 ,(R”) 
provided 


(9.45) IIflles., = [besa C2)) 24 Paremreren Se: 


Note that BS... = C. Also, B§.» = H'®, since (?(L?(IR”)) = L?(R”, £?). 
Extend Theorem 9.1 to results of the form 


p(z,D): F3t" > FS,, p(a,D): Bey” — Be. 


(See [Mal].) 
6. We define the symbol class CY S77 to consist of p(x, €) € Cy ST" such that 


(9.46) p(a,€) ~ >> p; (a, €) 


j20 
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where p; (a, €) € Cy Si" is homogeneous of degree m — j in €, for || > 1, and (9.46) 
means that the difference between the left side and the sum over 0 < 7 < N belongs 
to Coa If r € R* \ Z*, we also denote the symbol class by C’S%”’. Show that 
estimates of the form (9.3) and (9.4) have simpler proofs in this case, derived from 
expansions of the form 


(9.47) pi(,€) = >> py (a) ||" Fur (él *); 


for |€| > 1, where {w,} is an orthonormal basis of L?(S”~") consisting of eigenfunc- 
tions of the Laplace operator. 
7. Take 6 € (0, 1] and assume p(x,€) € L° Sf", so 


sup |Dép(«,€)| < Ca(é) |. 
Show that p* (a, €), defined by (9.28)-(9.30), satisfies p# (a, €) € STs. 
Hint. Replace (9.31) by the estimate 

|| DE Je fllze < Cae "I fllze, 


This estimate is known as Bernstein’s inequality. Prove it. . 


10. Paradifferential operators 


Here we develop the paradifferential operator calculus, introduced by J.-M. Bony 
in [Bon]. We begin with Y. Meyer’s ingenious formula for F'(u) as M(x, D)u+R 
where F' is smooth in its argument(s), u belongs to a Hélder or Sobolev space, 
M (a, D) is a pseudodifferential operator of type (1,1), and R is smooth. From 
there, one applies symbol smoothing to M/(a,£) and makes use of results estab- 
lished in § 9. 

Following [Mey], we discuss the connection between F'(u), for smooth non- 
linear F’, and the action on u of certain pseudodifferential operators of type (1, 1). 
Let w;(€) = yj (€)? be the Littlewood-Paley partition of unity (5.37), and set 
Wil) = Dijcn Ys (€)- Given u (e.g., in C"(R")), set 


(10.1) up = VU, (D)u, 

and write 

(10.2) F(u) = F(uo) + [F(u1) — F(uo)] +--+ + [F(ueq1) — F(ua)] +2 - 
Then write 


F(up41) — F(up) = F (ur + veri (D)u) — F(ur) 
(10.3) = m()Ve41(D)u, 
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where 


(10.4) mp(x) =| F" (WU; (D)u + te41(D)u) dt. 


Consequently, we have 


F(u) = F(uo) )+ 2 mala )be+i(D)u 


(10.5) = M(c, Dyu- i F (uo), 

where 

(10.6) -> mx (x)br+1(€) = Mr(u; 2, §). 
We claim 

(10.7) M(z,£) € 5}, 


provided wu is continuous. To estimate M(x, €), note first that by (10.4) 
(10.8) Ilmxll ze < sup |F’(A)]- 


To estimate higher derivatives, we use the elementary estimate 


(10.9) ||D*g(h) nx <C SS Ig! Yew-al] D“ Al|nx ++ ||D™ All z= 


fi te-+h,<e 


to obtain 
(10.10) || Dimellz~© < Coll F’"\Ice-1 (lull) - 2°, 


granted the following estimates, which hold for all u € L°: 


(10.11) || V,(D)u + tupa1(D)ull po < C]lullz< 
and 
(10.12) || D°[W,(D)u + tue41(D)ull| p02 < Cp2**||ul] p 


for t € [0,1]. Consequently, (10.6) yields 


(10.13) |D¢M (a, €)| < Ca sup |F’(A)|(6)7!*! 
aN 
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and, for || > 1, 

(10.14) |DEDEM(«, €)| < CapllF"ovi-. [lull z=)! gle, 
We give a formal statement of the result just established. 

Proposition 10.1. If F is C° and u € C” with r > 0, then 

(10.15) F(u) = Mr(u;z, D)ut+ R(u), 


where 


R(u) = F(vo(D)u) € C* 


and 
(10.16) Mp(u;2,£) = M(x,8) € S$}. 


Following [Bon] and [Mey], we call Mpr(u; x, D) a paradifferential operator. 
Applying Theorem 9.1, we have 


(10.17) ||! (a, D)fllnse < K||f\lz=», 
for p € (1,00), s > 0, with 

(10.18) K = Ky(F,u) = C|F"llon[1 + llullz~], 
provided 0 < s < N, and similarly 


(10.19) | M(a, D)f| 


co: S Alf 


Cs: 


Using f = u, we have the following important Moser-type estimates, extending 
Proposition 3.9: 


Proposition 10.2. If F is smooth with || F"||cn a) < 00, and0 < s <.N, then 


(10.20) |F(u)|las2 < Ky(F,u)|lullase + ||R(u)|l ase 
and 
(10.21) |F(w\lcs < Kn (F, u)llulles + ||R(u)|les, 


given 1 <p < oo, with Kn(F,u) as in (10.18). 


This expression for Ky(F’,u) involves the L°-norm of u, and one can use 
F"'||cw (7), where I contains the range of u. Note that if F(u) = u?, then 
CN (I) g 
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F’(u) = 2u, and higher powers of ||u||z-0 do not arise; hence we obtain the 
estimate 
(10.22) ||? | z75.0 < Cs|lullz< - ||ullzs., 5 > 0, 


and a similar estimate on ||u?||cs. 
It will be useful to have further estimates on the symbol M(2,€) = 


Mr(u;xz,&€) when u € C” with r > 0. The estimate (10.12) extends to 


|| D*[Wa(D) f + teegi(D) SF] ||. < Cellfller, (<r, 


(10.23) 
CEM flor, L>1, 


so we have, when u € C’, 


D8 D2M(z,£)| < Kaglé)7!, ran 
aoe |D2DEM(x,€)|< Kag(€) ae 
Kaglé)tt8-r, [gl > r, 
with 
(10.25) Kop = Kap(F,u) = CaallF'llciail + lull). 


Also, since Y;,(D) + t¢,+41(D) is uniformly bounded on C”, for t € [0, 1] and 
k > 0, we have 


(10.26) |IDEM(-,8)ller S$ Kar (Ey, 
where Kg, is as in (10.25), with |G] = [r] + 1. This last estimate shows that 
(10.27) ue Cv => Mp(u;2,£) € O'S). 
This is useful additional information; for example, (10.17) and (10.19) hold for 
5 > —r, and of course we can apply the symbol smoothing of § 9. 

It will be useful to have terminology expressing the structure of the symbols 


we produce. Given r > 0, we say 


(10.28) p(z,€) € AST, <=> ||DEp(-, E)llor < Café)” "1 and 
|D8 Dep(x,£)| < Cag(eyr-leollel-), [B| > vr. 

Thus (10.24)-(10.26) yield 

(10.29) M(a,é) € A’S?, 


for the M(x, €) of Proposition 10.1. If r € Rt \ Z*, the class A’S7" coincides 
with the symbol class denoted by A” by Meyer [Mey]. Clearly, A° Ls = Sts 
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and 
A'S" Cc C’ S49 Nn S15. 


Also, from the definition we see that 


(10.30) p(x, €) € A"S™, —> Dp(z, 6) € ST, for || <r, 
, StI") for |p| > 


It is also natural to consider a slightly smaller symbol class: 
(10.31) p(x, €) € ADTs > ||DEv(-,)llor+s S Cas(Ey™ tl", 8 20. 
Considering the cases s = 0 and s = || — r, we see that 
Ap Sis C AST. 
We also say 
(10.32) p(x,€) € "Si"; <=> the right side of (10.30) holds, 


sO 
PF cigee ene 


The following result refines (10.29). 


Proposition 10.3. For the symbol M(x,€) = Mpr(u;x,&) of Proposition 10.1, 
we have 


(10.33) M(z,€) € A§S? 4, 

provided u € C", r>0. 

Proof. For this, we need 

(10.34) llmallorss < C+ 2". 

Now, extending (10.9), we have 

(10.35) lIg(A)llorse < Cllgllow [1 + [lAll=](U[hllor+e +1), 


with N = [r + s] + 1, as a consequence of (10.21) when r + s is not an integer, 
and by (10.9) when it is. This gives, via (10.4), 


(10.36) lmallar+s < C(llullze) sup ||(Ye + thess)ullor+s, 
te 


where I = [0, 1]. However, 
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(10.37) (Ua + tdesi)ullorts < C+ 2" |lullor. 


For r+ s € Z", this follows from (9.41); for r +s ¢ Z*, it follows as in the 
proof of Lemma 9.8, since 


(10.38) 2-*SAS(, + typ41) is bounded in OPS} o. 


This establishes (10.34), and hence (10.33) is proved. 


Returning to symbol smoothing, if we use the method of § 9 to write 


(10.39) M(a,€) = M*(a,£) + M°(z, €), 
then (10.27) implies 
(10.40) M*¥(2,€)€ 5%, M(a,é) eC sty”. 


We now refine these results; for 1/* we have a general result: 


Proposition 10.4. For the symbol decomposition defined by the formulas (9.27)— 
(9.30), 


(10.41) p(x, £) € C’ST, => p* (a, €) € Ap ST. 


Proof. This is a simple modification of (9.42) which essentially says that 
p* (a, €) € A’S1",; we simply supplement (9.41) with 


(10.42) Jef | 


ieee |fI 


cr, § = 0, 
which is basically the same as (10.37). 
To treat M°(a, €), we have, for 6 < 7, 
(10.43) p(w, £) € ABST, => p’(a, £) € C'S NAZST, C STE, 


where containment in C’S)"5 5” follows from (9.35). To see the last inclusion, 


note that for p? (x, €) to belong to the intersection above implies 


ee <x C(O llr: ford<e<e 
C(éyn—lelt+s—-r)y, for s > r. 


In particular, these estimates imply p? (a, €) € 9 i "® This proves the following: 


Proposition 10.5. For the symbol M(x,€) = Mr(u;x,&) with decomposition 
(10.39), 
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(10.45) ue C” => M*(2,é) € Sy). 


Results discussed above extend easily to the case of a function F’ of several 
variables, say u = (w1,..., uz). Directly extending (10.2)-(10.6), we have 


L 
(10.46) F(u) = 50 M;(2, D)uj + F(Wo(D)u), 
j=l 
with 
(10.47) M,(2,€) = >> mh (a)bnsi(), 
k 
where 
1 
(10.48) ma) = | (0; F) (Ux (D)u + teeyi(D)u) dt. 


Clearly, the results established above apply to the MM; (x, €) here; for example, 
(10.49) u€ CT => M;(z,€) € Apsys. 

In the particular case F'(u, v) = wv, we obtain 
(10.50) uv = A(u;z,D)v + A(v; 2, D)u+ Vo(D)u- Vo(D)v, 


where 


[wa Du + 5ee+1(D)u] on ss(€). 


lags: 


(10.51) A(u; x, €) = 


> 
Il 


1 


Since this symbol belongs to S' tt for u € L®, we obtain the following extension 
of (10.22), which generalizes the Moser estimate (3.21): 


Corollary 10.6. For s > 0, 1 < p < o, we have 
(10.52) ||wel|z=» < C[llul|z~|lollas.e + |lull z= |lvllz~]. 


We now analyze a nonlinear differential operator in terms of a paradifferential 
operator. If F’ is smooth in its arguments, in analogy with (10.46)-(10.48) we have 


(10.53) F(x,D™u) = S> M,(2,D)D°u+ F(x, D™Wo(D)u), 


Ja|<m 


where F'(x, D™Wo(D)u) € C™ and 
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(10.54) = 2 me (x)be41(€), 
with 


1 
(10.55) mf (a) -| (OF /OGa) (x, Ux(D)D™u + tee y1(D)D™u) dt. 
0 


As in Propositions 10.1 and 10.3, we have, for r > 0, 
(10.56) ue C™r => M,(a,€) € ART, C S11 NCS). 
In other words, if we set 


(10.57) M(uz,D)= S~ M(x, D)D 


jal<m 
we obtain 
Proposition 10.7. [fu € C'™*", r > 0, then 
(10.58) F(a, D™u) = M(u;2, D)ut+ R, 
with R € C® and 
(10.59) M(u;2,€) € AgSty C STAC ST. 


As in Propositions 10.4 and 10.5, in this case symbol smoothing yields 


(10.60) M(u;«,€) = M*(a,€) + M°(a, 6), 
with 
(10.61) M#(a,é) € ASST, M(a,6) est". 


A specific choice for symbol smoothing which leads to paradifferential opera- 
tors of [Bon] and [Mey] is the following operation on M(z, €): 


(10.62) M* (x, €) =) U,_5M(z, £) ve(€), 
k 


where, as in (9.28), UV,_5 acts on M(z,€) as a function of 2. We use V,_5 = 
W,—5(D), with We(€) = D0 <2 Yi (€). We have 


(10.63) M(a,€) € L©S1% => M* (a, €) € B,ST%,, 
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with p = 1/16, where we define B,.S7", for p < 1 to be 
(10.64) BST = {0(a,€) € Si: b(n, €) supported in |n| < plé|}, 


and where b(,€) = | b(e,é)e"* de. Set BST —Uy21B ST. 

Most of the applications of the material of this section made in the following 
chapters of this book will involve symbol smoothing, (10.60)—(10.61), with 6 < 1. 
However, we will establish some basic results on operator calculus for symbols of 
the form (10.64). 

We will analyze products a(z,D)b(z,D) = p(a,D) when we are given 
a(a,€) € Si',(IR") and b(x,€) € BSi"(IR"). We are particularly interested in 
estimating the remainder r,,(«, €), arising in 


(10.65) a(w, D)b(x, D) = py(w, D) + r(x, D), 
where 
j—le| 
(10.66) p(a,é) = D> —-ABa(e, €)- 02b(c, €). 
jal<v 


Proposition 10.8 below is a variant of results of [Bon] and [Mey], established 
in [AT]. 
To begin the analysis, we have the formula 


(10.67) r,(a,€) = _ / fate.é+n)— > agate, e)]e b(n.) dn 


lal<v 
Write 
(10.68) ne = > ree), 
j20 
with 
yj (a, €) = [Assn Bieén) dn 
(10.69) = f Ase. Bile. dy, 


where the terms in these integrands are defined as follows. Pick 0 > 1, and take a 
Littlewood—Paley partition of unity {y5 : J > O}, such that yo(7) is supported in 
|n| < 1, while for j > 1, yj (7) is supported in 3~! < |n| < JI. Then we set 
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Ays(e6.m) = Goa [alee +) — Ly TaRate.6)] e500) 


(10.70) Ja|<v 


Bj(a,€,n) = O(n, )e;(ne*". 


Note that 

(10.71) B;(x,§,y) = pj(Dy)o(@ + y, §)- 
Thus 

(10.72) |By(@,€, Iz < CO, ler. 
Also, 


(10.73) supp (7, €) C {|nl < plé|} => By(z,€,y) =0, for 7! > ple. 


We next estimate the L'-norm of A,;(x,£,-). Now, by a standard proof of 
Sobolev’s imbedding theorem, given K > n/2, we have 


(10.74) Av; (2,6 ln < CIT; ALs(2,€, Jl, 


where [; f (7) = f (7), so Ay is supported in |7| < @. Let us use the integral 
formula for the remainder term in the power-series expansion to write 


(10.75) | 
Ap (a,b) = 
. j 1 
pi (Wn) S- v+1 (| @ = s)”*02a(a,€ + sn) as) lane, 
0 


n ! 
(27) rare a! 


Since |7| < ¥ on the support of [4 if also W—! < pl€|, then |07| < pd? |E]. 
Now, given p € (0,1), choose 0 > 1 such that pJ* < 1. This implies (€) ~ 
(€ + sn), for all s € [0, 1]. We deduce that the hypothesis 


(10.76) \O¢a(a,€)| < Ca(€)"?2!*!, for jal >v +1, 
implies 
(10.77) | Avj (2, €,)Ilz2 < CLhwUtY (eyH2-¥ 1 for WI-1 < plél. 


Now, when (10.72) and (10.77) hold, we have 


(10.78) Irvj(a, | < C,0et-") (ey—” 1, Oller, 
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and if (10.73) also applies, we have 
(10.79) |r, (@, | S CAO Oller if v+1>r, 


since 
So wit < olgpt 
0I-1 <plE| 


in such a case. 
To estimate derivatives of r,(x,€), we can write 


DE De ryj(#,€) = 
(10.80) 
» ay (F)(2) [oe pe avile sa) ‘ D® DY B,(2,€, agi 


Bi +82=8 y1+72=7 


Now D# DP A,j;(z,&,y) is produced just like A,,(x,&,y), with the symbol 
a(x,€) replaced by DED) alae); and DP DY B; (x,&,—y) is produced just 
like Bj (x, €, —y), with b(x, €) replaced by D?? DP b(x, €). Thus, if we strengthen 
the hypothesis (10.76) to 


(10.81) |O8d¢a(z, €)| < Cag (Eye? le4!4l, for jal >v +1, 
we have 
G08 |[DPD 4@e imager ee are, 


for #/—1 < p|€|. Furthermore, extending (10.72), we have 


(10.83) || D3? D?? B; (a, €,-)\ln~ < Crp l2l-7)5 || DPW, E)|ler- 
Now 
(10.84) S- gi@ti+]Bal-r) < Cle|ett+lBal—+ 

07 <plé| 


ifv +1 >, so as long as (10.73) applies, (10.82) and (10.83) yield 


(10.85)  |D8D?r,(2,6)|<C YS) (eye tel-lnl-r DP, Eller 
yt+y2=Y7 


if +1 > r. These estimates lead to the following result: 


Proposition 10.8. Assume 


(10.86) a(z,£)€ St, b(z,€) € BST. 
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Then 
(10.87) a(x, D)b(x, D) = p(x, D) € OPSYj™. 
Assume furthermore that 
(10.88) |AZAZa(x, 6)| < Copley te"! for al >v +1, 
with [4g < p, and that 
(10.89) | DE(-, Olce $ Caleymell, 
Then, ify + 1 > 1, we have (10.65)-(10.66), with 
(10.90) hie D) COP 

The following is a commonly encountered special case of Proposition 10.8. 
Corollary 10.9. In Proposition 10.8, replace the hypothesis (10.89) by 
(10.91) D8b(x,€) € ST?, for |B| = K, 
where K € {1,2,3,...} is given. Then we have (10.65)-(10.66), with 
(10.92) mighe0rs* fuSk. 
Proof. The hypothesis (10.91) implies (10.89), with r = K. 


We can also deduce from Proposition 10.8 that a(a, D)b(a, D) has a complete 
asymptotic expansion if b(a,€) is a symbol of type (1,6) with 6 < 1. 


Corollary 10.10. If0 <6 < 1and 

(10.93) a(x,€) € Sy, b(a,£) € S™, 

then a(a, D)b(a, D) € Orsi.” and we have (10.65)-(10.66), with 
(10.94) r,(2,D) e OPS), 


Proof. Altering b(, €) by an element of S; 5°, one can arrange that the condition 


(10.73) on supp (7, €) hold. Then, apply Corollary 10.9, with mz = m+ K6, so 
m2 — K =m-— K(1-— 4), and take K =v. 


Note that, under the hypotheses of Corollary 10.10, 
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1 m—-V\i— 
(10.95) S- a OE ala, €)  APH(a, €) € Caco 


ja|=v 
so we actually have 
(10.96) ry-1(a, D) € OPSET™ 70-9). 


The family U;,OPBS7", does not form an algebra, but the following result is 
a useful substitute: 


Proposition 10.11. If p;(x,£) € Bp, Sit and p = pi + p2 + prp2 <1, then 


P1 (zs, €)po(a, g) = Bs, 


(10.97) a 
pi(x, D)pe(x,D) € OPB,ST it". 


Proof. The result for the symbol product is obvious; in fact, one can replace 
p by pi + po. As for A(z,D) = pi(ax,D)p2(a,D), we already have from 
Proposition 10.8 that A(z,€) € S$ lilies we merely need to check the support of 


A(n, €). We can do this using the formula 


(10.98) 4,8 = i Ailn —G,£ + Opeale,é) dt. 


Note that given (7, €), if there exists ¢ € IR” such that p1(7 — ¢,€ + ¢) #4 0 and 
Pa(¢,&) #0, then 


l7—€|<pil€+¢l|, [el < pal€l, 
SO 
In| < prlE + ¢] +1¢] < prl€| + eile] + pal€l < (p1 + p2 + pipe) ||. 


This completes the proof. 


Nonlinear differential operators with rough coefficients 

Here we extend some of the paradifferential analysis produced above to op- 
erators F'(2, D™u) where F is not C™ in all its arguments (particularly x). An 
example to keep in mind (which will arise in §17 of Chapter 14) is 


(10.99) F(z, Du) = Du(z)'h(x) Du(x), he C%(Q), 


where u : Q > R” (Q open in R”) and h : Q > M(n,R). As in (10.53)-(10.55), 
we use the formulas 
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(10.100) F(a, D™u) = M,(u;x,D)D°u+ F(a, DVo(D)u), 
Ja|<m 

where 

(10.101) gag d meta) esr (€), 

with 


1 
(10.102) mete) = f (OF /OCq) (x, U4(D)D™u + thes1(D)D™u) dt. 


Proposition 10.12. Assume 0 < r < s, u € C™*", Also assume F = F(x, (€) 
satisfies 


(10.103) D?F €C*%, ors>landD(F EC. 
Then, for each a, {m& : k € Z*} is bounded in C, and O(2*—") in C8, hence 
(10.104) Ma(u; x, &) € C'S? 5 mces? gy 0H 1-7/2. 
Proof. This follows from the observation that 
{,(D)D™u + thy1(D)D™u:k € Z*, t € [0,1)} 
is bounded in C” and O(2*(8—")) = O(2*°*) in CS. 


We rewrite (10.100) as 


(10.105) F(a,D™u) = Mer(u;z,D)u+R, REC, 
with 
(10.106) Mpr(u; 2, €) € C™ST9 NCLST"s 


when the hypotheses of Proposition 10.12 hold. Now we can pick y € (6,1) and 
apply symbol smoothing, to write 


(10.107) Mp(u; 2, €) = M# (a, €) + M*(a, €). 


By (9.37), 
(10.108) 
Mp(u;2,£) € CST; > M#(a,£) € ST, M*(2,€) € C28 OO, 
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Results on the action of M°(x, D) on various function spaces can then be obtained 
from Proposition 9.10, supplemented by (9.43). We have 


M?(a,D): Hotm—s0-8)P _y FFow, 


(10.109) crtms(48) 


—> CE, 
provided p € (1,00) and 


—(l-—y)s<oa<-s, and also provided 
(10.110) : 
a= 8s, for action on Zygmund spaces. 


Exercises 


1. Prove the commutator property: 
(10.111) [OPS*, OPAGS1s] C OPSTF"", O<r<1,0<d6<1. 
2. Prove that, for0 < 6 < 1, 


(10.112) P € OPAGS'; => P* € OPA} S's. 


(Hint: Use P(x, D)* = P*(x, D), with P* (a, €) ~ )) DE De p(a, €). Show that 
p(2,€) € ApSi's => D3 DEp(e,£) € AdSys OP!) 
3. Show that 


(10.113) S> aa(a,D"~*u)D°u = M(u;x, D)u + R, 


|o|<m 
where R € C™ and, for0 < r <1, 
(10.114) uec™ t" —» M(u;a,é) € ApST1 + ST". 


Deduce that you can write 


(10.115) M(u;2x,€) = M* (a, £) + M(x, 8), 
with 
(10.116) M* (z,€) € ASST, M(a,e)€ ST”. 


Note that the hypothesis on wu is weaker than in Proposition 10.7. 
4. The estimate (10.9) follows from the formula 


D*g(h) = ba Clai,..., av) hor... Ae) g (ph), 


ayt:-+ay,=a 
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which is a consequence of the chain rule. Show that the following Moser-type estimate 
holds: 


(10.117) ID6g(A)llz-~ SC SS ll‘ lcr-allrllzs' || D'Allz~. 


1<v<e 


5. The paraproduct of J.-M. Bony [Bon] is defined by applying symbol smoothing to the 
multiplication operator, Myu = fu. One takes 


(10.118) Tyu = S > Wx-s(D)f - dx(D)u, 
k 


where, as in (10.62), Ve(€) = ><, Ys (€). Show that, with Ty = F(x, D), 
(10.119) f € L™(R") => F(z, €é) € $2.1 (R”). 


Show that, for any r € R, 


(10.120) 
f © CU(R”) => |D8D? F(z, | < Caallfller(€)7 0", for Ja] > 1. 


6. Using Propositions 10.8-10.11, show that if p(x, €) € Bi/2S7%, then 
(10.121) f ©C? = [T;, p(a2, D)] € OPBST%. 


Applications of this are given in [AT]. 
7. Show that p(x, €) € BS7, implies p(x, D)* € OPS7"1, and, if p is sufficiently small, 


(10.122) p(x, €) € Bp Si => p(x, D)* € OPBST}. 
8. Investigate properties of operators with symbols in 


(10.123) BST, = BST, N AjST. 


11. Young measures and fuzzy functions 


Limits in the weak* topology of sequences f; € L?(Q) are often not well behaved 
under the pointwise application of nonlinear functions. For example, 


(11.1) sin nz 0 weak” in L*((0, z]), 
while 

: 2 1 * foe) 
(11.2) sin’ nt > 5 weak* in L™((0, 7]) 


(see Fig. 11.1). A fuzzy function is endowed with an extra piece of structure, 
allowing for convergence under nonlinear mappings. 
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Assume 2 is an open set in R”. Given 1 < p < ov, we define an element of 
Y?(Q) to be a pair (f, \), where f € L?(Q) and J is a positive Borel measure on 
Q x R (R = [—co, ov]), having the properties 


(11.3) y € L?(Qx R,dX(z,y)), 


(so, in particular, 2. x {+00} has measure zero), 
(11.4) ME x R) = £L"(B), 


for Borel sets E C Q, where £” is Lebesgue measure on 2), and 


(11.5) J [vane = [ioe 


EXR 


for each Borel set FE C (. We can equivalently state (11.4) and (11.5) as 


(11.6) [few \ dmc) = f 2) és 


and 


(11.7) ff e@varey) = f eee) de, 


for y € Co(Q), that is, for continuous and compactly supported vy. 
Note that (11.5) implies 


(11.8) [ise sax < ff yl ax(e.y), 


EXxR 


A 
HAAN 


FIGURE 11.1 Approaching a Fuzzy Limit 
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since we can write EF = E, U E2 with f > 0 on E; and f < 0 on £y. If we 
partition F into tiny sets, on each of which f is nearly constant, we obtain 


(11.9) / LF(a)|? de < / lw? dd(c, y). 
E ExR 


We say that (f, A) is a fuzzy function, and \ is a Young measure, representing f. 
A special case of such 4 is yf, defined by 


(11.10) i | (x,y) dys(z,y) = / (x, f(x) de, 


for w € Co(Q x R). We say (f, yr) is sharply defined. 
Fuzzy functions arise as limits of sharply defined functions in the following 
sense. Suppose f; € L?(Q), 1 < p < oo, and (f, A) € Y?(Q). We say 


(11.11) fy > (f,A) in Y?(Q), 
provided 

(11.12) f; > f weak" in L?(Q) 
and 

(11.13) Vf; 2A weak” in M(Q x R), 


and furthermore, 
(11.14) IIyll ie» axBary,) <C<o. 
Actually, (11.12) is a consequence of (11.13) and (11.14), thanks to (11.9). 


To take an example, if 2 = (0,7) and f,(a) = sin na, as in (11.1), it is easily 
seen that 


(11.15) fn (0,A0) in Y*(Q), 
where 

2dxd 
(11.16) ddo(#,y) = xt-1,)(¥) == 


Via 


Also, 


(11.17) fio (5.1) in, ¥(Q), 


78 13. Function Space and Operator Theory for Nonlinear Analysis 


where 


2 dx dy 
y(y — 1) 
The following result illustrates the use of Y?(Q) in controlling the behavior of 


nonlinear maps. We make rather restrictive hypotheses for this first result, to keep 
the argument short and reveal its basic simplicity. 


(11.18) dx (2, Y) = Xjo0,1(y) 


Proposition 11.1. Let & : R — R be continuous. If f; + (f, A) in Y°°(Q), then 
(11.19) ®(f;) +g weak" in L*(Q), 


where g € L®(Q) is specified by 


(11.20) [ae x) dz = [few )dX(x,y), ~ € Co(Q). 


Proof. We need to check the behavior of [ ®(f;)y dx. Since ®(f;) is bounded 
in L©(Q), it suffices to take y in Co(Q), which is dense in L1(Q). Let I be a 
compact interval in (—oo, oo), containing the range of each function ®(f;). Now, 
for any y € Co(Q), 


ye (ede = ff o(e)6W) dry (av) 


Qxt 


5, y) dX(x,y), 


Ox 


(11.21) 


since yp, + A weak” in M(Q x I). This proves the proposition. 


Under the hypotheses of Proposition 11.1, we see that, more precisely than 
(11.19), 


(11.22) ®(f;) > (g,v) in YO(Q), 


where g is given by (11.20) and v is specified by 
123) ff oe.) atau) = [f oe, 2) aey), be ColOxB). 


Thus is the natural image of \ under the map ®(,y) = (x, ®(y)) of Qx I> 


Q x R. One often writes v = ®,... The extra information carried by (11.22) is 
that ya(f;) + v, weak” in M(Q x R), which follows from 
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[f{ ew dyo(p,)(@y) = [[e@ew) dys, (2,y) 
(11.24) ay [[e@ew) dX(x,y). 
We can extend Proposition 11.1 and its refinement (11.22) to 
(11.25) fj > (f,A) in Y?(Q) => ®(F;) > (g,v) in YA(Q), 


with 1 < p,q < ov, where g and v are given by the same formulas as above, 
provided that 6 : R — R is continuous and satisfies 


(11.26) |®(y)| < Clyl?/*. 


We need this only for large |y| if has finite measure. 
This result suggests defining the action of ® on a fuzzy function (f, A) by 


(11.27) ®(f,) = (9,), 


where g and v are given by the formulas (11.20) and (11.23). Thus (11.22) can be 
restated as 


(11.28) fj > (frd) in Y°(Q) => B(f;) > B(f, dA) in YQ). 
It is now natural to extend the notion of convergence f; > (f, A) in Y?(Q) to 


(f5,49) + (Cf, A) in Y?(Q), provided all these objects belong to Y?(Q) and we 
have, parallel to (11.12)-(11.14), 


(11.29) f; > f weak" in L?(Q), 
(11.30) Aj >A weak* in M(Q x R), 
and 

(11.31) lvllzecaxkar,) SC <0. 


As before, (11.29) is actually a consequence of (11.30) and (11.31). Now (11.28) 
is easily extended to 


(11.32) (f;,A;) > (fA) in ¥°°(Q) => B(f;,A;) 3 B(f, A) in YQ), 


for continuous ® : R — R. There is a similar extension of (11.25), granted the 
bound (11.26) on ®(y). 

We say that f; (or more generally (f;,;)) converges sharply in Y?(Q), if 
it converges, in the sense defined above, to (f,) with A = yr. It is of interest 
to specify conditions under which we can guarantee sharp convergence. We will 
establish some results in that direction a bit later. 
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When one has a fuzzy function (f,), it can be conceptually useful to pass 
from the measure \ on 2 x R to a family of probability measures \,, on R, defined 
for a.e. x € 2. We discuss how this can be done. From (11.4) we have 


(11.33) // w(y) aX(x, y)| < sup |¥| £L"(), 
ExR 
and hence 
a3 | ff o(e)e(u) are,y)| < sup | -lellaney 
QxR 


It follows that there is a linear transformation 
(11.35) T:C(R) > L*(Q),  ||Td||z~(ay < sup |, 
such that 
(11.36) [fe@vw) ae = f oe)P(o) ae 
QxR Q 
Using the separability of C(IR), we can deduce that there is a set S C Q, of 
Lebesgue measure zero, such that, for all  € C(R), T7(z) is defined pointwise, 


for x € Q \ S. Note that T is positivity preserving and T(1) = 1. Thus for each 
x € Q\ S, there is a probability measure \,, on R such that 


(11.37) THe) = / #(y) de(0) 

Hence 

ais) ff stow) aXe») - [( [eee ) duty) 
QxR Q SR 


From this it follows that 
(11.39) {een olen = [(fvenan »)) 
QxR 


for any Borel-measurable function w that is either positive or integrable with re- 
spect to d\. Thus we can reformulate Proposition 11.1: 
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Corollary 11.2. [f ® : R — Ris continuous and f; — (f,) in Y*(Q), then 


(11.40) ®(f;) +g weak* in L*(Q), 

where 

(11.41) g(x) = few drx(y), ae cE. 
R 


One key feature of the notion of convergence of a sequence of fuzzy func- 
tions is that, while it is preserved under nonlinear maps, we also retain the sort of 
compactness property that weak* convergence has. 


Proposition 11.3. Let (f;,A;) € Y°(Q), and assume || f;\|L-(a) < M. Then 
there exist (f, A) € Y°°(Q) and a subsequence (f;,,, X;,,) such that 


(11.42) (fv Ay.) —> (f,A)- 


Proof. The well-known weak* compactness (and metrizability) of {g € L°(Q) : 
llg|lzc° < M} implies that one can pass to a subsequence (which we continue to 
denote by (f;,,)) such that f; > f weak” in D°(Q). 

Each measure \,; is supported on 2. x I, I = [—M, M]. Now we exploit the 
weak* compactness and metrizability of {u € M(K x I) : ||u|| < £°(U&)}, 
for each compact kK C Q, together with a standard diagonal argument, to obtain 
a further subsequence such that \;, — A weak* in M(Q x I). The identities 
(11.6) and (11.7) are preserved under passage to such a limit, so the proposition 
is proved. 


So far we have dealt with real-valued fuzzy functions, but we can as easily con- 
sider fuzzy functions with values in a finite-dimensional, normed vector space V . 
We define Y?(Q, V) to consist of pairs (f, \), where f € L?(Q,V) is a V-valued 
L? function and 2 is a positive Borel measure on Q x V (V = V plus the sphere 
Soo at infinity), having the properties 


(11.43) ly| € L°?(Q x V,dX(2,y)), 
so in particular 2 x S,, has measure zero, 
(11.44) MExV)=L"(E), 


for Borel sets # C Q, and 


(11.45) ffyren- [ioaer 
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for each Borel set EF C2. 

All of the preceding results of this section extend painlessly to this case. Instead 
of considering ® : R — R, we take ® : V; — Vo, where V; are two normed finite- 
dimensional vector spaces. This time, a Young measure “disintegrates” into a 
family ,, of probability measures on V. 

There is a natural map 


(11.46) & : Y%(Q,Vy) x Y%(Q, V2) —> ¥°(2, Vi @ Va) 
defined by 
(11.47) (fi, A1)&( fa, A2) = (fa ® fe, v), 


where, for a.e. x € Q, Borel Fy C V;, 
(11.48) Vy(F x Fy) => Ate (F1)A20 (Fo). 


Using this, we can define an “addition” on elements of Y°(Q, V): 


(11.49) (f1,A1) + (fa, A2) = S((fi, Ar) &(f2, A2)), 


where 5: V @V — V is given by S(v, w) = vu + w, and we extend S to a map 
S:Y°(Q,V eV) > Y*(Q, V) by the same process as used in (11.27). 

Of course, multiplication by a scalar a € R, M, : V — V, induces 
a map M, on Y°(Q,V), so we have what one might call a “fuzzy linear 
structure” on Y°(Q, V). It is not truly a linear structure since certain basic re- 
quirements on vector space operations do not hold here. For example (in the case 
V = R), (f,A) € Y*(Q) has a natural “negative.” namely (—f,), where 
\(E) = (—E). However, (f, 4) + (—f,A) 4 (0,70) unless (f, A) is sharply 
defined. Similarly, (f, A) + (f,A) # 2(f, A) unless (f, A) is sharply defined, so 
the distributive law fails. 

We now derive some conditions under which, for a given sequence u; — (u, A) 
in Y°°(Q)) and a given nonlinear function F’, we also have F(u;) + F(u) weak* 
in L°(Q), which is the same here as F(u) = F’. The following result is of 
the nature that weak* convergence of the dot product of the R?-valued functions 
(uj, F(u;)) with a certain family of R?-valued functions V(u;) to (u, F)-V will 
imply F = F(u). The specific choice of V (u;) will perhaps look curious; we will 
explain below how this choice arises. 


Proposition 11.4. Suppose uj; — (u,X) in Y°(Q), and let F : R > R be C'. 
Suppose you know that 


(11.50) ujq(u;) — F(u;)n(uj;) —> ug— F7 weak* in L®(Q), 
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for every convex function 7 : R > R, with q given by 


(1151) aly) = fo (s) FCs) ds, 

and where 

(11.52) q(u,rA)= (G1), F(u,r) = (F v2), n(u,A) = @, v3). 

Then 

(11.53) F(u;) > F(u) weak" in L*(Q). 

Proof. It suffices to prove that F = F(u) a.e. on Q. Now, applying Corollary 


11.2 to B(y) = yq(y) — F(y)n(y), we have the left side of (11.50) converging 
weak™ in L°°(Q) to 


v(x) = [law — F(y)n(y)] dAc(y), 


so the hypothesis (11.50) implies 
v=ug—F7, ae.onQ. 


Rewrite this as 
(11.54) i (F(y) — F(x))n(y) - (u(z) — v)a(u)} dvx(y) =0, aewen. 


Now we make the following special choices of functions 7 and q: 

(11.55) na(y) =|ly-al, da(y) = sgn(y — a) (F(y) — F(a). 

We use these in (11.54), with a = u(x), obtaining, after some cancellation, 
(11.56) (F(u(x)) — F(z) i) ly —u(x)| dv\c(y) =0, aexredQ. 
Thus, for a.e. x € Q, either F(x) = F(u(x)) or Ar = 5y(z), Which also implies 


F(x) = F(u(z)). The proof is complete. 


Why is one motivated to work with such functions 7(u) and q(u)? They arise in 
the study of solutions to some nonlinear PDE on 2 C R?. Let us use coordinates 
(t,z) on Q. As long as wu is a Lipschitz-continuous, real-valued function on Q, it 
follows from the chain rule that 


(11.57) up + F(u)e =0 => nu) + d(u)e = 0, 
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provided q'(y) = »’'(y)F’(y), that is, q is given by (11.52). (For general u € 
L°°(Q), the implication (11.57) does not hold.) Our next goal is to establish the 


following: 


Proposition 11.5. Assume u; € L°°(Q), ofnorm < M < oo. Assume also that 


(11.58) Oyuj + OrF (uj) 40 in Hy (Q) 
and 
(11.59) On(uj) + Oxq(uj) precompact in H,,2(Q), 


for each convex function n : R — R, with q given by (11.51). If uj — u weak* in 
L(Q), then 


(11.60) Ou + 0, F'(u) = 0. 
Proof. By Proposition 11.3, passing to a subsequence, we have u; — (u, A) in 


Y°(Q). Then, by Proposition 11.1, F(u;) > F’, q(u;) > 9g, and n(u;) > 7 
weak* in L°(Q). Consider the vector-valued functions 


(11.61) vj = (uj, F(uj)), wy = (a(uj), —n(uy)). 


Thus v; — (u,F), wj; > (¢,—-7) weak* in L°(Q). The hypotheses (11.58)— 
(11.59) are equivalent to 


(11.62) div v;, rot w,; precompact in Het: 


Also, the hypothesis on ||u,;||,-. implies that v; and w, are bounded in L°(Q), 
and a fortiori in L?,,(Q). The div-curl lemma hence implies that 


(11.63) vj-wj>v-win DQ), v=(u,F), w= (g-7). 
In view of the L°°-bounds, we hence have 

(11.64) ujq(u;) — F(uj)n(u;) —> ug—F7 weak* in L™(Q). 
Since this is the hypothesis (11.50) of Proposition 11.4, we deduce that 
(11.65) F(u;) —> F(u) weak” in L°(Q). 


Hence 0,u; + 0,F (u;) + Ou + 0,F(u) in D’(Q), so we have (11.60). 


One of the most important cases leading to the situation dealt with in Proposi- 
tion 11.5 is the following; for ¢ € (0, 1], consider the PDE 
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(11.66) OpUe + Op F (ue) = €02ue on N= (0,00) xR, u-(0) = f. 


Say f € L°°(R). The unique solvability of (11.66), for t € [0, co), for each e > 0, 
will be established in Chap. 15, and results there imply 


(11.67) Ue € C%(Q), 

(11.68) Iluellt~©(ay < Wf llz~, 

and 

(11.69) ef f- (Ortte)? dx dt < slits. 


The last result implies that \/€0,,u- is bounded in L?(Q). Hence ¢0?u- > 0 
in H~‘(Q), ase — O. Thus, if Ue, is relabeled uj, with ce; — 0, we have 
hypothesis (11.58) of Proposition 11.5. We next check hypothesis (11.59). 

Using the chain rule and (11.66), we have 


(11.70) O:n(ue) + Orq(ue) = €02n(ue) — en! (Uc) (One), 


at least when 7 is C? and q satisfies (11.52). Parallel to (11.69), we have 


(11.71) ef filualien dx dt = [ult@) ax ~ f n(ue(T,2)) dex. 


A simple approximation argument, taking smooth 7; — 7, shows that whenever 
7 is nonnegative and convex, C? or not, 


(11.72) Opn(uc) + Orq(ue) = €02n(ue) — Re, 
with 
(11.73) R- bounded in M(Q). 


Since 0,7(ue) = 7'(uz)OzUe, and any convex 77 is locally Lipschitz, we deduce 
from (11.68) and (11.69) that \/€0,,.7(u-) is bounded in L?(Q). Hence 


(11.74) e02n(ue) +0 in H-1(Q), ase 0. 
We thus have certain bounds on the right side of (11.72), by (11.73) and (11.74). 


Meanwhile, the left side of (11.72) is certainly bounded in H,"?(Q), Vp < cv. 
This situation is treated by the following lemma of F. Murat. 
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Lemma 11.6. Suppose F is bounded in HP (Oy, for some p > 2, and F Cc 
G +H, where G is precompact in 1: (Q) and His bounded in M jge(Q). Then 
F is precompact in H;,'(Q). 


Proof. Multiplying by a cut-off y € C§°(Q), we reduce to the case where all 
f © F are supported in some compact Kk, and the decomposition f = g + h, 
g € G, h € Halso has g,h supported in K. Putting K in a box and identifying 
opposite sides, we are reduced to establishing an analogue of the lemma when (2 
is replaced by T”. 

Now Sobolev imbedding theorems imply 


M(T") CH™*"(T"), se€(0,n), ge (1 7 


Via Rellich’s compactness result (6.9), it follows that 


(11.75) u: M(T") 4 H-14(T"), compact Vq € (1, Z -): 
n— 


Hence H is precompact in H~'4(T”), for any q < n/(n — 1), so we have 
(11.76) FF precompact in H~'4(T”"), bounded in H~'?(T”), p> 2. 


By a simple interpolation argument, (11.76) implies that F is precompact in 
H~1'(T"), so the lemma is proved. 


We deduce that if the family {u. : 0 < « < 1} of solutions to (11.66) satisfies 
(11.67)-(11.69), then 


(11.77) Opn(ue) + Orq(ue) precompact in H,,.'(), 
which is hypothesis (11.59) of Proposition 11.5. Therefore, we have the following: 


Proposition 11.7. Given solutions uz, 0 < € < 1 to (11.66), satisfying (11.67)- 
(11.69), a weak* limit u in L°(Q), as € = €; — 0, satisfies 


(11.78) Ou + 0,F(u) = 0. 


The approach to the solvability of (11.78) used above is given in [Tar]. 
In Chap. 16, §6, we will obtain global existence results containing that of 
Proposition 11.7, using different methods, involving uniform estimates of 
||O.Ue(t)||z1(g)- On the other hand, in §9 of Chap. 16 we will make use of 
techniques involving fuzzy functions and the div-curl lemma to establish some 
global solvability results for certain 2 x 2 hyperbolic systems of conservation 
laws, following work of R. DiPerna [DiP]. 

The notion of fuzzy function suggests the following notion of a “fuzzy solu- 
tion” to a PDE, of the form 
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a) 
(11.79) S> a Fi(u) =0. 
J 
Namely, (u, A) € Y°°(Q) is a fuzzy solution to (11.79) if 
O : ; = 
(11.80) S- apf i= 9 in DQ), File) = | Fi(y) doy), 
j 
j 


This notion was introduced in [DiP], where (u, A) is called a “measure-valued 
solution” to (11.79). Given |Fj(y)| < C(y)?, we can also consider the concept 
of a fuzzy solution (wu, A) € Y?(Q). Contrast the following simple result with 
Proposition 11.5: 


Proposition 11.8. Assume (u;,A;) € Y°(Q), |lusllne < M, and (u;,A;) > 
(u, A) in Y%°(Q). If 


(11.81) S > xFe(uj) +0 in D'(Q), 
k 


as j — ©, then wu is a fuzzy solution to (11.79). 


Proof. By Proposition 11.1, Fy(u;) 4 Fx weak* in L°(Q). The result follows 
immediately from this. 


In [DiP] there are some results on when one can say that, when (u,A) € 
Y™°(Q) is a fuzzy solution to (11.79), then u € L°(Q) is a weak solution to 
(11.79), results that in particular lead to another proof of Proposition 11.7. 


Exercises 


1. If f; > (f,A) in Y°°(Q), we say the convergence is sharp provided \ = v7. Show 
that sharp convergence implies 


fj > f in L?(Qo), 


for any Q) CCQ. 
(Hint: Sharp convergence implies | f;|” — ||? weak* in L°°(Q). Thus f; — f weakly 
in L? and also Fi llz2@a0) > WF llz2(a0)) 

2. Deduce that, given f; > (f, A) in Y°(Q), the convergence is sharp if and only if, for 
some subsequence, f;, — f a.e.onQ. 

3. Given (f, 4) € Y°°(Q) and the associated family of probability measures Az, x € 2, 
as in (11.37)-(11.39), show that AX = yy if and only if, for a.e. x € Q, Az is a point 
mass. 

4. Complete the interpolation argument cited in the proof of Lemma 11.6. Show that (with 
X=A7l(F))ifq<2<p, 


X precompact in L4(T”), bounded in L?(T”) —> X precompact in L?(T”). 
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(Hint: If fn € X, fn — f in L4(T”), use 


Ilfn — fllz2 < fn — fllfallfn — fllze*-) 


5. Extend various propositions of this section from Y*°(Q) to Y?(Q), 1 < p< oo. 


12. Hardy spaces 

The Hardy space $)'(IR") is a subspace of L1(IR”) defined as follows. Set 
(12.1) (Gf)(x) = sup{|y: * f(z)|: 9 EF, t > O}, 

where vy: (2) = t-"yp(x/t) and 

(122) F = {pe C@(R") oz) = 0 for |x| > 1, [|Vellz~ <1}. 
This is called the grand maximal function of f. Then we define 

(12.3) H'(R") = {f € L'(R"): Gf € L'(R")}. 


A related (but slightly larger) space is h'(IR”), defined as follows. Set 


(12.4) (G° f)(a) = sup{lygr * f(@)|: 9 € F, 0<t <I}, 
and define 
(12.5) 6 (R”) = {f € LI(R”) : Gf € L(R")}. 


An important tool in the study of Hardy spaces is another maximal function, 
the Hardy-Littlewood maximal function, defined by 


(12.6) MiNle)=sup Segy fl dy 


B,(a) 


The basic estimate on this maximal function is the following weak type-(1,1) 
estimate: 


Proposition 12.1. There is a constant C = C(n) such that, for any \ > 0, f € 
L1(R"), we have the estimate 


(12.7) meas({x € R" : M(f)(a) > A}) < C 


Il fla. 


Note that the estimate 
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meas({x ER”: |f(x)| > X}) < <|lfllz 


1 

A 

follows by integrating the inequality |f| > Axs,, where Sy = {|f| > A}. 
To begin the proof of Proposition 12.1, let 


(12.8) Fy ={a@ € R”: Mf(xz) > A}. 


We remark that, for any f € L1(IR") and any \ > 0, F) is open. Given x € F), 
pick r = r, such that A,|f|(x) > A, and let B, = B,.,,(x). Thus {B, : x € Fy} 
is a covering of F', by balls. We will be able to obtain the estimate (12.7) from the 
following “covering lemma,” due to N. Wiener. 


Lemma 12.2. IfC = {B, : a € YA} is a collection of open balls in R”, with 
union U, and if mp < meas(U), then there is a finite collection of disjoint balls 
B; €C, 1< 39 <K, such that 


(12.9) S| meas(B;) > 3-"mo. 


We show how the lemma allows us to prove (12.7). In this case, letC = {B, : 


x € Fy}. Thus, if mo < meas(F)), there exist disjoint balls B; = B,.,(a;) such 
that meas(UB,;) > 37”"mo. This implies 


3” 3” 
(12.10) mo < 3” 5” meas(B;) < => [ro dx < | Fo) dx, 
B; 


for all mp < meas(F), which yields (12.7), with C = 3”. 

We now turn to the proof of Lemma 12.2. We can pick a compact K Cc U 
such that m(A) > mo. Then the covering C yields a finite covering of K, 
say Aj,..., An. Let B, be the ball A; of the largest radius. Throw out all A, 
that meet B;, and let Bz be the remaining ball of largest radius. Continue until 
{Aj,..., An} is exhausted. One gets disjoint balls B,,..., Bx inC. Now each 
Aj meets some By, having the property that the radius of By is > the radius of A;. 
Thus, if B; is the ball concentric with B;, with three times the radius, we have 


kK N 
LJ B > UW Ac> K. 
j=l 


l=1 


This yields (12.9). 
Note that clearly 


(12.11) fe L™(R") = |M(f)llz~ < Ifllz~- 
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Now the method of proof of the Marcinkiewicz interpolation theorem, Proposition 
5.4, yields the following. 


Corollary 12.3. If 1 < p < ©, then 
(12.12) IM(f)llze < Cpl fllze- 


Our first result on Hardy spaces is the following, relating h!(IR”) to the smaller 
space §1(R"). 


Froposiion 12.4. [fu € 6'(R”) has compact support and { u(x) dx = 0, then 
u € §1(R"). 


Proof. It suffices to show that 
(12.13) v(x) = sup{|y: * u(x)|: g € F,t > 1} 


belongs to L1(IR”). Clearly, v is bounded. Also, if supp u Cc {|z| < R}, then we 
can write u = S>O;u;, uj € L'(Br). Then 


(12.14) 
yr * u(x “or “pjeruy(e), Yee) = tj’ 2), ¥;(e) = O;9(@). 


If |z] = R+1+ p, then 7,1 * u;(x) = 0 fort < p, so 


(12.15) v(x) < Cp! S> M(u;)(2). 


J 
The weak (1,1) bound (12.7) on M now readily yields an L+-bound on v(2). 


One advantage of h'(IR”) is its localizability. We have the following useful 
result: 


Proposition 12.5. [fr > 0 and g € C"(R”) has compact support, then 

(12.16) u € h'(R") = gu € h'(R”). 

Proof. If g © C” and0 < r < 1, we have, for all y € Ff, 

(1217) [ge (guy(v) —gle)orxu(a)| sce” fh Ju(y)l ay 
Bi (ax) 


Hence it suffices to show that 


12. Hardy spaces 91 


(12.18) v(x) = sup t”™” / ju(y)| dy 


(12.19) v(a) < Y Ae hu) dy, 


where v(x) is the characteristic function of {|2|] < 1}, this is clear. 


Given 2 c R” open, u € Ly,.(Q), we say 
(12.20) UE Hige(Q) —> gue h'(R"), Vg € CHr(Q). 


This is equivalent to the statement that, for any compact K C Q, there isavu € 
§1(IR”) such that u = v ona neighborhood of K. To see this, note that if u € 
Hi,-(Q) and g € C§°(Q), g = 1 ona neighborhood of K, then gu € 61(R”). 
Now take v = gu +h, where h € C§°(IR”) has support disjoint from supp g, and 
[ h(x) dv = — f g(x)u(2z) da. By Proposition 12.4, v € §1(IR"). The converse 
is established similarly. 

Not every compactly supported element of L1(IR”) belongs to h!(IR”), but we 
do have the following. 


Proposition 12.6. [fp > 1 and u € L?(IR”) has compact support, then wu © 
5 (R"). 

Proof. We have 

(12.21) (OP f)(x) < Gf)(x) < CMf(a). 


Hence, given p > 1, u € L?(R") > Gu € L?(R"). Also, G°u has support in 
|jc| < R+1if supp u Cc {|x| < R}, s0Geu € L'(R"). 


The spaces $)'(IR”) and h1(IR") are Banach spaces, with norms 
(12.22) lulls: = [Gullzr, — [lellgs = [19° ull zs. 


It is useful to have the following approximation result. 
Proposition 12.7. Fix € C§°(R") such that f(x) dx = 1. Ifu € §'(R”), 
then 


(12.23) |e ¥U — Ullg1 20, as e > 0. 


Proof. One easily verifies from the definition that, for some C' < co, G(w. * u) 
(x) < CGu(x),V x,V  € (0, 1]. Hence, by the dominated convergence theorem, 
it suffices to show that 
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(12.24) G(wexu-—u)(%)—->0, ae.x, as e 0; 
that is, 


sup |(yr* Ye *U— Ye *Uu)(z)| 90, aex, ase. 
t>0,peF 


To prove this, it suffices to show that 


(12.25) lim sup sup (ve ke KU — Yt * u)(x)| =0, ae. a, 
€,0-0 Q<t<é yEeF 


and that, for each 6 > 0, 


(12.26) lim sup sup |(yr * He — Ye) * u(x) | = 0. 
e—0 4>5 wEeF 


In fact, (12.25) holds whenever x is a Lebesgue point for u (see the exercises for 
more on this), and (12.26) holds for all 2 € R”, since u € L1(IR”) and, for all 
yp € F, we have ||y; * We — Ys||[L~ < Cet-™-1. 


Corollary 12.8. Let Tyu(x) = u(x + y). Then, for u € 9'(R”), 

(12.27) \|Tyu — ull5: —> 0, as |y| > 0. 

Proof. Since ||7'\|c(51) = 1 for all y, it suffices to show that (12.27) holds for 
u in a dense subspace of $'(IR"). Thus it suffices to show that, for each e > 0, 


u€ §1(R"), 


(12.28) Jim, I|Ty (ve * u) — be ¥ ullg: =0. 
y i 


But Ty (We * u) — Pe * U= (Wey — We) * u, Where 
(12.29) Dey() — Pe(x) =e” [W(e“* (a + y)) — H(e“*2)]. 


Thus 


I|Ty (be *U) — Ye * Ull 52 


sup _ ||(Wey — Ve) * Ye * Ulf 
t>0,peF 


IPey — Pelle [lulls 
< Chyler" ull, 


(12.30) 


IA 


which finishes the proof. 


It is clear that we can replace $! by h! in Proposition 12.7 and Corollary 12.8, 
obtaining, for u € 1(R"), 


(12.31) Ile *u — Ullgs — 0, 
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as € — OQ, and 
(12.32) |Tyu — Ully2 — 0, 


as |y| > 0. 
We can also approximate by cut-offs: 


Proposition 12.9. Fix. € C§°(R"), so that x(a) = 1 for |x| <1, 0 for |x| > 2, 
and0 < x <1. Set xr(x) = x(x/R). Then, given u € h'(R"), we have 


(12.33) jim lu — XRU||p1 = 0. 


Proof. Clearly, G°(u — yRu)(x) = 0, for |x| < R—1, so 
jim G?(u—xRu)(z) =0, Va eR”. 


To get (12.33), we would like to appeal to the dominated convergence theorem. In 
fact, the estimates (12.17)-(12.19) (with g = 1 — yr) give 


(12.34) G?(u— xRu)(x) < Gu(x) + Av(x), VR>1, 


where A = ||Vx||z~, and v(z) is given by (12.19), with r = 1, sov € L1(R"). 
Thus dominated convergence does give 


(12.35) jim |G?(u—xrRu)||z1 = 0, 
and the proof is done. 
Together with (12.31), this gives 
Corollary 12.10. The space C§°(IR") is dense in h!(R"). 
A slightly more elaborate argument shows that 
(12.36) Do = {u € Co? (R"): fu dz = o} 
is dense in §'(R"); see [Sem]. 
One significant measure of how much smaller §!(R”) is than L1(IR”) is the 


following identification of an element of the dual of §1(IR”) that does not belong 
to L°(R”). 


Proposition 12.11. We have 


(12.37) [Fo log |z| dz] < Cl| fll 5- 


94 13. Function Space and Operator Theory for Nonlinear Analysis 


Proof. Let \(x) € C§°(R”) satisfy A(x) = 1 for |x| < 1, A(x) = 0 for |x] > 2. 
Set 


(12.38) é(x) = — D1 r?2) +5) \(1-A(27F2)). 


It is easy to check that 
(12.39) log |a| — (log 2)@(a”) € £°°(R”). 


Thus it suffices to estimate [ f(a)¢(x) dx. We have 
(12.40) | / f(a)t(x) de| < > | 7 F(a) A(2x) de 
j=-oo 


We claim that, for each 7 € Z, 


(12.41) | f #@)aex) da| <C2-" inf Gf. 


2=9 ) 


In fact, given 7 € Z, z € Bo-; (0), we can write 
(12.42) [ for rgia) dz = Ko, * f(z), 


with r = 2?-J, K = K(A,n), for some y € F; say (x) is a multiple of a 
translate of \(47). Consequently, with S; = Bz-;(0), we have 


(12.43) [fr@maalse > f or=cttiy. 


FEN Bia 
By Corollary 12.8, we have the following: 
Corollary 12.12. Given f € 91(R"), 
(12.44) log f € C(R”). 


The result (12.37) is a very special case of the fact that the dual of §1(IR") is 
naturally isomorphic to a space of functions called BMO(R"”). This was estab- 
lished in [FS]. The special case given above is the only case we will use in this 
book. More about this duality and its implications for analysis can be found in 
the treatise [S3]. Also, [S3] has other important information about Hardy spaces, 
including a study of singular integral operators on these spaces. 
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The next result is a variant of the div-curl lemma (discussed in Exercises for 
§ 6), due to [CLMS]. It states that a certain function that obviously belongs to 
L*(R") actually belongs to §!(IR”). Together with Corollary 12.12, this produces 
a useful tool for PDE. An application will be given in § 12B of Chap. 14. The proof 
below follows one of L. Evans and S. Muller, given in [Ev2]. 


Proposition 12.13. [fu € L?(R",R"), v € H}'(R"), and div u = 0, then 
u-Vu € 91(R”). 


Proof. Clearly, wu: Vu € L'(R”). Now, with y € C§°(IR”), supported in the unit 
ball, set y-(y) = r-"p(r-1(a — y)). We have 


(12.45) ic -Vv)pr dy = - / (uv — Ug,r)u+ Ver dy, 
B,(ax) 


since div u = 0. Thus, with Cp = ||Vy||z-~, 


Ci 
(12.46) [tu -Vv)0r dy| < = / |u — Ve,r| - |ul dy. 
B,(a) 
Take 
2n p 
(12.47) pe (2, —".): pts @ (1,5), 
n—2 p-l1 
Then 
1/p 1/q 
Co 
| fu: Voyer dy] <2 [fe joven” ay jul? dy 
r 
B,(a) B,(a) 
1/p 1/q 
Co 
(12.48) < a |Vo|? dy |u|? dy ; 
B,(x) B,(«) 


where p = pn/(p+n) < 2 anda =n +1. Consequently, 
| [Veer dy] < Cone (ioye) 2M (Ie) 
(12.49) < Co{M(|Vo]?)”/? + M (|ul?) 7/7}. 


By Corollary 12.3, we have |M(Val?) || 270 < Cll Vol? || 2/05 and so 
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2 
if auiver) ? der < c| |Vol? da. 


Similarly, 
if aiuiny?/ dx < cf jul? de. 


Hence 


C(|Vu|? + lullz2)- 


(12.50) |ju-Vullg. = sup | fi (u-Vu)y, 
yEeF,r>0 


We next establish a localized version of Proposition 12.13. 


Proposition 12.14. Let Q C R®” be open. If u € L?(Q,R"), divu = 0, and 
v € H'(Q), thenu- Vv € ;,(Q). 


Proof. We may as well suppose n > 1. Take any O C Q, diffeomorphic to a 
ball. It suffices to show that u - Vv is equal on © to an element of 1(R”). Say 
Occ U cc Q, with U also diffeomorphic to a ball. Pick y € CS°(U), x =1 
on O. 

Let @ € L?(Q,A"~') correspond to wu via the volume element on 2. Then 
dit = 0. We use the Hodge decomposition of L?(U, A”~'), with absolute bound- 
ary condition: 


(12.51) a = ddG4% + 5dG4i+ PAG on u. 


Since du = 0, we have by (9.48) of Chap. 5 that 6dG4% = 0. Also, given n > 1, 
H"-!(U) = 0, so PAG = 0, too, and so 


(12.52) ai=—db, we H(U,A"”?). 


Having this, we define a vector field up) on R” so that tp = d(xw), and we 
set Up = Xv. It follows that uo, vo satisfy the hypotheses of Proposition 12.13, so 
uo: Vuo € 91(R"). But uo - Vuo = u- Vu on O, so the proof is done. 


Let us finally mention that while we have only briefly alluded to the space 
BMO, it has also proven to be of central importance, especially since the work 
of [FS]. More about the role of BMO in paradifferential operator calculus can 
be found in [T2]. Also, Proposition 12.13 can be deduced from a commutator 
estimate involving BMO, as explained in [CLMS]; see also [AT] and [T4]. 


Exercises 
We say x € R” is a Lebesgue point for f € L*(IR”) provided 


5 f uw — f(2)| dy =0. 


” aes 


_ vol(B) 
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In Exercises 1 and 2, we establish that, given f € L*(R"), ae. 2 € R” is a Lebesgue 
point of f. 
1. Set 


2. Given » > 0, let 
ny: 1 
By ={o eR" :limsup ey f If) - f@lav > a}. 
B,(z) 


Take € > 0, and take g € C§°(R”) so that || f — g|| 1 < €. Show that Fy is unchanged 
if f is replaced by f — g. Deduce that 


1 1 
By c {ai M(t o)(e)> Sabu ta: ite) aa > 5a}, 
and hence, via Proposition 12.1, 


meas(Ex) < Sif - alla <<. 


Deduce that meas(/,) = 0,V A > 0, and hence a.e. x € R” is a Lebesgue point for f. 


3. Now verify that (12.25) holds whenever x is a Lebesgue point of w. 
4. If u: R? > R?, show that 


u € H'(R?) = det Du € §'(R’). 


(Hint: Compute div w, when w = (Oyu1, —Ozru2).) 
5. Ifu: R? — R®, show that 


u € H'(R?) = us X Uy € 9'(R’). 


(Hint. Show that the first argument of uz X uy is det Dv, where v = (w2, u3).) 


A. Variations on complex interpolation 


Let X and Y be Banach spaces, assumed to be linear subspaces of a Hausdorff 
locally convex space V (with continuous inclusions). We say (X,Y, V) is acom- 
patible triple. For 6 € (0,1), the classical complex interpolation space [X,Y Jo, 
introduced in Chap. 4 and much used in this chapter, is defined as follows. First, 
Z=X+/Y gets anatural norm; forv € X + Y, 


(A.1) llu||z = inf {|v ||x + ||vally 1U= U1 + V2, ULE X, V2 E Y}. 
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One has X + Y = X @ Y/L, where L = {(v,-v) : « © X NY} is a closed 
linear subspace, so X + Y is a Banach space. Let Q = {z € C:0 < Rez < 1}, 
with closure 2. Define Hg(X,Y) to be the space of functions f : OQ + Z = 
X + Y, continuous on 2, holomorphic on Q (with values in X + Y), satisfying 
f : {Imz = 0} > X continuous, f : {Im z = 1} > Y continuous, and 


(A.2) lu(zIl2<C, llulinIlx <C, llul+ia)lly <C, 


for some C' < co, independent of z € Q and y € R. Then, for 6 € (0,1), 


(A.3) [X,Y]o = {u(@) : ue Ha(X,Y)}. 
One has 
(A.4) [X,Y]o © He(X, Y)/{u € Ha(X,Y) : u(@) = OF," 


giving [X, Y]g the sructure of a Banach space. Here 


(AS) — |lullocx,y) = sup |lu(z)||z + sup ||u(iy)||x + sup ||u(1 + zy) |ly- 
zEQ) ¥y y 


If J is an interval in R, we say a family of Banach spaces X,, s € I (subspaces 
of V) forms a complex interpolation scale provided that for s,t € I, 6 € (0,1), 


(A.6) [Xs, Xt]o = X(1~0)s+6¢- 


Examples of such scales include L?-Sobolev spaces X, = H*?(M), s € R, 
provided p € (1,00), as shown in § 6 of this chapter, the case p = 2 having been 
done in Chap. 4. It turns out that (A.6) fails for Zygmund spaces X, = C#(M), 
but an analogous identity holds for some closely related interpolation functors, 
which we proceed to introduce. 

If (X, Y, V) is acompatible triple, as defined in above, we define Ha(X, Y, V) 
to be the space of functions u : Q — X + Y = Z such that 


(A.7) u : Q —+ Z is holomorphic, 

(A.8) lu(allz2<C, |luliy)Ilx <C, llul+iy)lly <C, 
and 

(A.9) u:Q —> V is continuous. 


For such u, we again use the norm (A.5). Note that the only difference with 


Ho(X,Y) is that we are relaxing the continuity hypothesis for u on 2. 
Ha(X, Y,V) is also a Banach space, and we have a natural isometric inclusion 


(A.10) Ho(X,Y) O Ha(X,Y,V). 
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Now for @ € (0,1) we set 
(A.11) [X,Y]o.v = {u(9) : u € He(X,Y,V)}. 
Again this space gets a Banach space structure, via 
(A.12) [X,Ylov © Ha(X,Y,V)/{u € Hoal(X, Y,V) : u(9) = Of, 
and there is a natural continuous injection 
(A.13) [X,Y]o @ [X,Y ]o.v. 


Sometimes this is an isomorphism. In fact, sometimes [X,Y]o = [X,Y ]o.v for 
practically all reasonable choices of V. For example, one can verify this for X = 
L?(R”), Y = H*?(R"), the L?-Sobolev space, with p € (1,00),s € (0,00). 
On the other hand, there are cases where equality in (A.10) does not hold, and 
where |X, Y]o-v is of greater interest than [X, Y]o. 

We next define [X,Y]. In this case we assume X and Y are Banach spaces 
and Y C X (continuously). We take 2 as above, and set Q = {zEeC:0< 
Re z < 1}, ie., we throw in the right boundary but not the left boundary. We then 
define H2,(X, Y) to be the space of functions w : Q + X such that 


u:Q—+ X is holomorphic, 
(A.14) lu(z)Ilx $C, lu +iy)lly <C, 


u: 9 —> X is continuous. 


Note that the essential difference between Ha(X,Y) and the space we have 
just introduced is that we have completely dropped any continuity requirement 
at {Re z = 0}. We also do not require continuity from {Rez = 1} to Y. The 
space H2,(X,Y) is a Banach space, with norm 


(A.15) lull, (x,¥) = sup |u(z)||x + sup ||u(1 + zy)|ly. 
zEQ y 


Now, for @ € (0,1), we set 
(A.16) [X,Y]g = {u(0) :u€ Ho(X,Y)}, 


with the same sort of Banach space structure as arose in (A.4) and (A.12). We 
have continuous injections 


(A.17) [X,Y]o © [X,Ylox © [X,Y]}. 


Our next task is to extend the standard result on operator interpolation from the 
setting of [X, Y]o to that of [X, Y]o.v and [X, Y]5. 
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Proposition A.1. Let (X;,Y;,V;) be compatible triples, 7 = 1,2. Assume that 
T : Vi — Vo is continuous and that 


(A.18) Pe X= ke, iG 


continuously. (Continuity is automatic, by the closed graph theorem.) Then, for 
each 0 € (0,1), 


(A.19) T : [X1,VYilo;v, — [X2, Yalo;ve- 


Furthermore, if Y; C Xj (continuously) and T is a continuous linear map satis- 
fying (A.18), then for each 6 € (0,1), 


(A.20) T : [X1,Yilg — [Xo, Yo]p. 


Proof. Given f € [X1, Yalo.v, pick u € Ho(X1,%1,Vi) such that f = u(6). 
Then we have 


(A.21) T : Ho(X1,%1,Vi) 4 Hol(Xe, Yo, Va), (Tu)(z) = Tu(z), 
and hence 

(A.22) Tf = (Tu)(9) € [X2, Yalo.ve- 

This proves (A.19). The proof of (A.20) is similar. 


Remark: In case V = X + Y, with the weak topology, [X,Y ]9.y is what is 
denoted (X,Y) in [JJ], and called the weak complex interpolation space. 


Alternatives to (A.6) for a family X,, of Banach spaces include 


(A.23) [Xs, Xt]o,v = X(1-6)s+0t 
and 
(A.24) [X5,X1]8 = X(eys+ot- 


Here, as before, we take 6 € (0,1). It is an exercise, using results of § 6, to show 
that both (A.23) and (A.24), as well as (A.6), hold when X, = H*?(M), given 
p € (1,00), where M can be R” or a compact Riemannian manifold. We now 
discuss the situation for Zygmund spaces. 

We start with Zygmund spaces on the torus T”. We recall from § 8 that the 
Zygmund space C’(T”) is defined for r € R, as follows. Take y € C§°(R”), 
radial, satisfying y(€) = 1 for |E| < 1. Set y,(€) = y(27*E). Then set wo = ¥, 
Uk = Yr — Pr—-1 fork EN, so {Wx : k > 0} is a Littlewood—Paley partition of 
unity. We define C7 (T”) to consist of f € D’(T”) such that 
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(A.25) If llce = sup 2*" |e (D) fll b° < 00. 
k>0 


With A = (I — A)!/? and s,t € R, we have 

(A.26) re) 0), 

By material developed in § 8, 

(A.27) re Rt\Zt = CT") =C'(T"), 

where, if r = k + a with k € Zt and0 < a < 1, C"(T”) consists of functions 


whose derivatives of order < & are Hélder continuous of exponent a. 
We aim to show the following. 


Proposition A.2. Ifr <s <tand0 < @ <1, then 


(A.28) ereluga Cc (T” )Jo.orcan) = Cer ig), 
and 
(A.29) (ea), ry =e a: 


Proof. First, suppose f € [C3,C%]o.cr, so f = u(0) for some u € Ho(C%, 
Ct, C®). Then consider 


(A.30) u(z) = e® AC5)7 ASu(z), 

Bounds of the type (A.8) on u, together with (8.13) in the torus setting, yield 
(A31) llo(iy)lloo, v1 + iy)lloe <C, 

with C independent of y € R. In other words, 

(A.32) lIYe(D)v(z) |r $C, Rez =0,1, 


with C independent of Im z and k. Also, for each k € Zt, ~x(D)v : Q > 
L*°(T”) continuously, so the maximum principle implies 


(A.33) IlWi(D) AC 97 AS fll p20 < C, 


independent of k € Z*. This gives AC-%s+9 Ff € C®, hence f € 
Coe Ts: 
Second, suppose f € Coen Te, Set 
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(A.34) u(z) =e? AG-2)G-s) F. 

Then u(6) = e® f. We claim that 

(A.35) u € He(C2,C*, C7), 


as long as r < s < t. Once we establish this, we will have the reverse containment 
in (A.28). Bounds of the form 


(A.36) Ilu(z)| 


as $C, lu t+wy)lle <¢ 


follow from (8.13), and are more than adequate versions of (A.8). It remains to 
establish that 


(A.37) u:Q—+ Cl(T"), continuously. 


Indeed, we know u : 2 + C$(T") is bounded. It is readily verified that 


(A.38) u:Q—+D'(T"), continuously, 
and that 
(A.39) r<s= > C{(T") 6 Cl(T") is compact. 


The result (A.37) follows from these observations. Thus the proof of (A.28) is 
complete. 

We turn to the proof of (A.29). If u € H2,(C8, C!), form u(z) as in (A.30), 
and for e € (0, 1] set 
(A.40)  u-(z) =e *Av(z), ve: Q> C°®(T”) bounded and continuous 
(with bound that might depend on ¢). We have 
(AAl) by (D)ve(e + iy) = e+)" oy, (D)e~ AAC Att—5)¥ AS uz). 


Now {A‘u(z) : z € Q} is bounded in C°(T”), and the operator norm of Ai(¢~s)Y 
on C2(T”) is exponentially bounded in |y|. We have 


(A.42) {e~*4AeC—5) : 0 << 1} bounded in OPS! ,(T”), 
hence bounded in operator norm on C2(T”). We deduce that 
(A.43) Ilpn(D)ve(e + ty)llz~ < C, 


independent of y € R and ¢ € (0, 1]. The hypothesis on wu also implies 
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(A.44) Ilex (D)ve(1 + iy) |I|b~ SC, 


independent of y € R and ¢ € (0, 1]. Now the maximum principle applies. Given 
6 € (0,1), 


(A.45) Ive(D)e- (6) b=» < C, 


independent of ¢. Taking « \, 0 yields v(@) € C®(T”), hence u(e) € 
el ame as (T”) : 

This proves one inclusion in (A.29). The proof of the reverse inclusion is sim- 
ilar to that for (A.28). Given f € Cl'~98*"('T”), take u(z) as in (A.34). The 
claim is that u € H2,(C%,C‘). We already have (A.36), and the only thing that 
remains is to check that 


(A.46) u:Q —+ C8(T”) continuously, 


and this is straightforward. (What fails is continuity of u : Q > C5(T*) at the 
left boundary of Q.) 


Remark: In contrast to (A.28)-(A.29), one has 
(AAT) [C8(T"), C2(T”)]9 = closure of C°(T”) in CUT"). 
Related results are given in [Tri]. 


If OPS{'9(T") denotes the class of pseudodifferential operators on T” with 
symbols in ST", then for all s,m € R, 


(A.48) P€ OPS7,(I") = P202(T) 4+ C-"(0"). 


Cf. Proposition 8.6. Using coordinate invariance of OPS7"y and of C’(T”) for 
r € Rt \ Z*, we deduce invariance of C$(T") under diffeomorphisms, for all 
seER. 

From here, we can develop the spaces C$ (1/4) on a compact Riemannian man- 
ifold M and the spaces C$(M/) on a compact manifold with boundary. These 
developments are done in § 8. 
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Nonlinear Elliptic Equations 


Introduction 


Methods of the calculus of variations and other analytical techniques, applied to 
problems in geometry and classical continuum mechanics, often lead to elliptic 
PDE that are not linear. We discuss a number of examples and some of the devel- 
opments that have arisen to treat such problems. 

The simplest nonlinear elliptic problems are the semilinear ones, of the form 
Lu = f(x,D™~1u), where L is a linear elliptic operator of order m and the 
nonlinear term f(a, D’~1u) involves derivatives of u of order < m —1.In§1 
we look at semilinear equations of the form 


(0.1) Au = f(a,u), 


on a compact, Riemannian manifold M, with or without boundary. The Dirichlet 
problem for (0.1) is solvable provided 0, f(x,u) > 0 if each connected com- 
ponent of M has a nonempty boundary. If 17 has no boundary, this condition 
does not always imply the solvability of (0.1), but one can solve this equation if 
one requires f(x, wu) to be positive for u > a, and negative for u < ao. We use 
three approaches to (0.1): a variational approach, minimizing a function defined 
on a certain function space, the “method of continuity,” solving a one-parameter 
family of equations of the type (0.1), and a variant of the method of continuity 
that involves a Leray—Schauder fixed-point theorem. This fixed-point theorem is 
established in Appendix B, at the end of this chapter. 
A particular example of (0.1) is 


(0.2) Au = k(x) — K(x)e™, 


which arises when one has a 2-manifold with Gauss curvature k(x) and wants to 
multiply the metric tensor by the conformal factor e?“ and obtain K(x) as the 
Gauss curvature. The condition 0,,f(x,u) > 0 requires that A(x) < 0 in (0.2). 
In § 2 we study (0.2) on a compact, Riemannian 2-fold without boundary, given 
K(a) < 0. The Gauss—Bonnet formula implies that y(/) < 0 is a necessary 
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condition for solvability in this case; the main result of § 2 is that this is also a 
sufficient condition. When you take kK = —1, this establishes the uniformization 
theorem for compact Riemann surfaces of negative Euler characteristic. When 
x(M) = 0, one takes K = 0 and (0.2) is linear. The remaining case of this uni- 
formization theorem, y(/) = 2, is treated by a different method in Proposition 
2.9. We also mention alternative treatments of the uniformization theorem, for (/ 
homeomorphic to T? or S$, via the Riemann-Roch theorem, in Chap. 10, § 9. 

The next topic is local solvability of nonlinear elliptic PDE. We establish this 
via the inverse function theorem for C!-maps on a Banach space. We treat under- 
determined as well as determined elliptic equations. We obtain solutions in § 3 
with a high but finite degree of regularity. In some cases such solutions are actu- 
ally C'°°. In §4 we establish higher regularity for solutions to elliptic PDE that 
are already known to have a fair amount of smoothness. This result suffices for 
applications made in § 3, though PDE encountered further on will require much 
more powerful regularity results. 

In § 5 we establish the theorem of J. Nash, on isometric imbeddings of compact 
Riemannian manifolds in Euclidean space, largely following the ingenious simpli- 
fication of M. Giinther [Gu1], allowing one to apply the inverse function theorem 
for C!-maps on a Banach space. Again, the regularity result of § 4 applies, allow- 
ing one to obtain a C™-isometric imbedding. 

In § 6 we introduce the venerable problem of describing minimal surfaces. We 
establish a number of classical results, in particular the solution to the Plateau 
problem, producing a (generalized) minimal surface, as the image of the unit disc 
under a harmonic and essentially conformal map, taking the boundary of the disc 
homeomorphically onto a given simple closed curve in R”. 

In §7 we begin to study the quasi-linear elliptic PDE satisfied by a function 
whose graph is a minimal surface. We use results of § 6 to establish some results 
on the Dirichlet problem for the minimal surface equation, and we note several 
questions about this Dirichlet problem whose solutions are not simple conse- 
quences of the results of § 6, such as boundary regularity. These questions serve as 
guides to the results of boundary problems for quasi-linear elliptic PDE derived 
in the next three sections. 

In § 8 we apply the paradifferential operator calculus developed in Chap. 13, 
§ 10, to obtain regularity results for nonlinear elliptic boundary problems. We 
concentrate on second-order PDE (possibly systems) on a compact manifold with 
boundary M and obtain higher regularity for a solution u, assumed a priori to 
belong to C?2+”(M), for some r > 0, for a completely nonlinear elliptic PDE, or 
to C1+"(M), in the quasi-linear case. To check how much these results accom- 
plish, we recall the minimal surface equation and note a gap between the regularity 
of a solution needed to apply the main result (Theorem 8.4) and the regularity a 
solution is known to possess as a consequence of results in § 7. 

Section 9 is devoted to filling that gap, in the scalar case, by the famous 
DeGiorgi—Nash—Moser theory. We follow mainly J. Moser [Mo2], together with 
complementary results of C. B. Morrey on nonhomogeneous equations. Morrey’s 
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results use spaces now known as Morrey spaces, which are discussed in Appendix 
A at the end of this chapter. 

With the regularity results of §§ 8 and 9 under our belt, we resume the study of 
the Dirichlet problem for quasi-linear elliptic PDE in the scalar case, in § 10, with 
particular attention to the minimal surface equation. We note that the Dirichlet 
problem for general boundary data is not solvable unless there is a restriction on 
the domain on which a solution u is sought. This has to do with the fact that the 
minimal surface equation is not “uniformly elliptic.’ We give examples of some 
uniformly elliptic PDE, modeling stretched membranes, for which the Dirichlet 
problem has a solution for general smooth data, on a general, smooth, bounded 
domain. We do not treat the most general scalar, second-order, quasi-linear elliptic 
PDE, though our treatment does include cases of major importance. More material 
can be found in [GT] and [LU]. 

In § 11 we return to the variational method, introduced in § 1, and prove that a 
variety of functionals 


(0.3) I(u) = | F(a,u, Vu) dV(z) 
{ 


possess minima in sets 
(0.4) V ={ue HQ): u=gon AQ}. 


The analysis includes cases both of real-valued u and of u taking values in RY. 
The latter case gives rise to N x N elliptic systems, and some regularity results 
for quasi-linear elliptic systems are established in § 12. Sometimes solutions are 
not smooth everywhere, but we can show that they are smooth on the complement 
of a closed set % C (Q of Hausdorff dimension < n — 2 (n = dim Q)). Results of 
this nature are called “partial regularity” results. 

In § 13 we establish results on linear elliptic equations in nondivergence form, 
due to N. Krylov and M. Safonov, which take the place of DeGiorgi-Nash—Moser 
estimates in the study of certain fully nonlinear equations, done in § 14. In § 15 
we apply this to equations of the Monge—Ampere type. 

In § 16 we obtain some results for nonlinear elliptic equations for functions of 
two variables that are stronger than results available for functions of more vari- 
ables. 

In 817 we consider overdetermined nonlinear elliptic systems. We derive inte- 
rior regularity results, and illustrate these results with several examples. For one, 
we show how a metric tensor-preserving diffeomorphism between two Rieman- 
nian manifolds satisfies an overdetermined elliptic system. For another example, 
we consider how the Wey] tensor yields an overdetermined elliptic system for the 
conformally normalized metric tensor, when one uses n-harmonic coordinates. 
We also discuss applicability of these regularity results to the study of exterior 
differential systems. 
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At the end are several appendices. Appendix A discusses Morrey spaces. 
Appendix B establishes some Leray-Schauder fixed point theorems. Appendix 
C introduces the Weyl tensor, and derives its conformal invariance. 

One attack on second-order, scalar, nonlinear elliptic PDE that has been very 
active recently is the “viscosity method.” We do not discuss this method here; one 
can consult the review article [CIL] for material on this. 


1. Acclass of semilinear equations 
In this section we look at equations of the form 
(1.1) Au = f(x,u) onM, 


where M/ is a Riemannian manifold, either compact or the interior of a compact 
manifold M with smooth boundary. We first consider the Dirichlet boundary 
condition 


(1.2) Won — 9% 


where M is connected and has nonempty boundary. We suppose f € C°(M xR). 
We will treat (1.1)—(1.2) under the hypothesis that 


Of 


(1.3) a 0. 


Other cases will be considered later in this section. Suppose F(z,u) = 


je f(x, 8) ds, so 
Then (1.3) is the hypothesis that F'(«, u) is a convex function of u. Let 
1 2 
(1.5) I(u) = 5 llaullz cary + | F(a,u(x)) dV(z). 
M 
We will see that a solution to (1.1)—-(1.2) is a critical point of J on the space of 
functions u on M/, equal to g on OM. 
We will make the following temporary restriction on F’: 


(1.6) For |u| > K, 0, f(x, u) = 0, 


so F(x, u) is linear in u for u > K and for u < —K. Thus, for some constant L, 
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(1.7) |O, F(z,u)|<L onM xR. 
Let 
(1.8) V ={ue H'(M):u=gondM}. 


Lemma 1.1. Under the hypotheses (1.3)-(1.7), we have the following results 
about the functional I: V — R: 


(1.9) I is strictly convex; 


(1.10) I is Lipschitz continuous, 


with the norm topology on V ; 


(1.11) I is bounded below; 
and 
(1.12) I(v) > +00, as ||v|| 71 3 co. 


Proof. (1.9) is trivial. (1.10) follows from 

(1.13) |F(x,u) — F(a,v)| < Llu — vl, 

which follows from (1.7). The convexity of F(a, u) in u implies 
(1.14) F(a,u) > —Colul — C1. 

Hence 


ai 
T(u) > 5\|dull? — Collullzs — Ci 


(1.15) i i 
2 qiidullie = 5 Bllullie — Cllullz2 — C, 
since 
1 
(1.16) 5 lidull ze > Bllullz2 —C”, foru eV. 


The last line in (1.15) clearly implies (1.11) and (1.12). 


Proposition 1.2. Under the hypotheses (1.3)-(1.7), I(u) has a unique minimum 
onvV. 


Proof. Let ap = infy I(u). By (1.11), ao is finite. Pick R such that K = VM 
Br(0) 4 0, where Br(0) is the ball of radius R centered at 0 in H'(M), and 
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such that ||| 71 > R = I(u) > ap +1, which is possible by (1.12). Note that 
K is aclosed, convex, bounded subset of H'(/). Let 


(1.17) Ke = {we K: ap < I(u) < ag + ¢}. 


For each e > 0, KK is aclosed, convex subset of K. It follows that Kz is weakly 
closed in ’, which is weakly compact. Hence 


(1.18) () Ke = Ko #0. 


e>0 


Now inf I(w) is assumed on Ko. By the strict convexity of (uw), Ko consists of 
a single point. 


If wis the unique point in Ky andv € C§°(M), then u+sv € V, forall s € R, 
and I(u + sv) is a smooth function of s which is minimal at s = 0, so 


(1.19) O= < T(ut+ au)| 4 = (—Au,v) + / f (x, u(x)) v(x) dV(z). 
M 


Hence (1.1) holds. We have the following regularity result: 


Proposition 1.3. For k = 1,2,3,..., if g € H*®+'/?(QM), then any solution 
u € V to (1.1)-(1.2) belongs to H**1(M). Hence, if g € C°(OM), then u € 
C™(M). 


Proof. We start with u € H'(M). Then the right side of (1.1) belongs to H!(M) 
if f(a,u) satisfies (1.6). This gives u € H?(M), provided g € H*/?(0M). 
Additional regularity follows inductively. 


We have uniqueness of the element uw € V minimizing J(u), under the 
hypotheses (1.3)-(1.7). In fact, under the hypothesis (1.3), there is uniqueness 
of solutions to (1.1)-(1.2) which are sufficiently smooth, as a consequence of the 
following application of the maximum principle. 


Proposition 1.4. Let u and v € C?(M)M C(M) satisfy (1.1), with u = g and 
v = honOM. If (1.3) holds, then 


(1.20) sup (u—v) < sup (g—h) V0, 
M aM 


where a V b = max(a, b) and 


(1.21) sup |u — v| < sup |g — Al. 
M aM 
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Proof. Let w = u — v. Then, by (1.3), 
(1.22) Aw=X(r)w, wy, =9-A; 


where 
A(2) = f(a, u) 7 f(z, v) > 0. 
u—wv 
IfO = {% € M: w(x) > 0}, then Aw > 0 on O, so the maximum principle 
applies on O, yielding (1.20). Replacing w by —w gives (1.20) with the roles of 
u and v, and of g and h, reversed, and (1.21) follows. 


One application will be the following first step toward relaxing the hypothesis 
(1.6). 


Corollary 1.5. Let f(z,0) = v(x) € C*(M). Take g € C™*(OM), and let 
® € C~(M) be the solution to 


(1.23) A®=yonM, ®=gon0M. 


Then, under the hypothesis (1.3), a solution u to (1.1)-(1.2) satisfies 


(1.24) sup u<sup ®+ (sup (—®) Vv 0) 
M M M 
and 
(1.25) sup |u| <sup 2|®|. 
M M 


Proof. We have 


with A(x) = [f(a,u) — f(#,0)]/u > 0. Thus A(u- ©) >OonO={eeEM: 
u(x) > O}, so 


sup (u — ®) = sup (u — ®) < sup (—®) V0. 
6) 30 M 


This gives (1.24). Also A(® — u) > OonO7 = {2 € M: u(z) < O}, so 


sup (® — u) = sup (® — u) < sup ®V 0, 
o- ao- M 


which together with (1.24) gives (1.25). 


We can now prove the following result on the solvability of (1.1)-(1.2). 
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Theorem 1.6. Suppose f(x, u) satisfies (1.3). Given g © C~(OM), there is a 
unique solution u € C®(M) to (1.1)+(1.2). 


Proof. Let f;(x,u) be smooth, satisfying 
(1.27) fi(z,u) = f(z,u), for |u| <j, 


and be such that (1.3)-(1.7) hold for each f;, with kK = K,;. We have solutions 
uj € C™(M) to 


(1.28) Au; = fj (2, uj), Uslonr = 9: 


Now f;(x,0) = f(x,0) = y(a), independent of j, and the estimate (1.25) holds 
for all u;, so 


(1.29) sup |u;| < sup 2/9], 
M M 


where ® is defined by (1.23). Thus the sequence (u,;) stabilizes for large 7, and 
the proof is complete. 


We next discuss a geometrical problem that can be solved using the results 
developed above. A more substantial variant of this problem will be tackled in 
the next section. The problem we consider here is the following. Let M be a 
connected, compact, two-dimensional manifold, with nonempty boundary. We 
suppose that we are given a Riemannian metric g on M, and we desire to construct 
a conformally related metric whose Gauss curvature (x) is a given function on 
M. As shown in (3.46) of Appendix C, if k(a) is the Gauss curvature of g and if 
g' = e?“g, then the Gauss curvature of g’ is given by 


(1.30) K (a) = (-Au + k(a))e7™, 
where A is the Laplace operator for the metric g. Thus we want to solve the PDE 
(1.31) Au = k(x) — K(ax)e" = f(z,u), 


for u. This is of the form (1.1). The hypothesis (1.3) is satisfied provided 
K(a) < 0. Thus Theorem 1.6 yields the following. 


Proposition 1.7. If M is a connected, compact 2-manifold with nonempty bound- 
ary OM, g a Riemannian metric on M, and K € C®(M) a given function 


satisfying 


(1.32) K(x) <OonM, 
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then there exists u € C°(M) such that the metric g' = e2“g conformal to g has 
Gauss curvature K. Given any v € C®(0M), there is a unique such u satisfying 
u=vonoM. 


Results of this section do not apply if (2) is allowed to be positive some- 
where; we refer to [KaW] and [Kaz] for results that do apply in that case. 

If one desires to make (MM, g) conformally equivalent to a flat metric, that is, 
one with K(x) = 0, then (1.31) becomes the linear equation 


(1.33) Au = k(2). 


This can be solved whenever M is connected with nonempty boundary, with 
u prescribed on OM. As shown in Proposition 3.1 of Appendix C, when the 
curvature vanishes, one can choose local coordinates so that the metric tensor 
becomes 6;;. This could provide an alternative proof of the existence of local 
isothermal coordinates, which is established by a different argument in Chap. 5, 
§ 10. However, the following logical wrinkle should be pointed out. The deriva- 
tion of the formula (1.30) in § 3 of Appendix C made use of a reduction to the case 
Ik = e2¥ jk and therefore relied on the existence of local isothermal coordinates. 
Now, one could grind out a direct proof of (1.30) without using this reduction, thus 
smoothing out this wrinkle. 

We next tackle the equation (1.1) when M is compact, without boundary. For 
now, we retain the hypothesis (1.3), 0f/Ou > 0. Without a boundary for M, 
we have a hard time bounding u, since (1.16) fails for constant functions on M/. 
In fact, the equation (1.31) cannot be solved when K(x) = —1, k(x) = 1, and 
M = S?, so some further hypotheses are necessary. We will make the following 
hypothesis: For some a; € R, 


(1.34) Uu<ao=> f(x,u) <0, u>a=> f(x,u) > 0. 


If Of /Ou > 0, this is equivalent to the existence of a function u = y(a) such that 
f (a, y(x)) = 0. We see how this hypothesis controls the size of a solution. 


Proposition 1.8. [fu solves (1.1) and M is compact, then 
(1.35) ag < u(x) <a, 
provided (1.34) holds. 


Proof. If u is maximal at xo, then Au(29) < 0, so f(29,u(xo)) < 0, and so 
(1.34) implies u < a,. The other inequality in (1.35) follows similarly. 


To get an existence result out of this estimate, we use a technique known as 
the method of continuity. We show that, for each + € [0,1], there is a smooth 
solution to 
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(1.36) Au = (1—7T)(u—6)+7f(a,u) = f-(2,u), 


where we pick b = (ag + a1)/2. Clearly, this equation is solvable when 7 = 0. 
Let J be the largest interval in [0, 1], containing 0, with the property that (1.36) is 
solvable for all 7 € J. We wish to show that J = [0,1]. First note that, for any 
7 € [0,1], 


(1.37) u<ao => f-(a,u) <0, u>a=> f-(z,u) > 0, 
so any solution u = u, to (1.36) must satisfy 


(1.38) ag < u,(x) < ay. 


Using this, we can show that J is closed in [0,1]. In fact, let u; = u,, solve 
(1.36) for 7; € J, 7; 7 o. We have |lu;||z~0 < a@ < oo by (1.38), so 
gj(z) = f,,(x,uj(x)) is bounded in C(M). Thus elliptic regularity for the 
Laplace operator yields 


(1.39) \|u5 llor~) < br < 00, 


for any r < 2. This yields a C’-bound for g;, and hence (1.39) holds for any 
r <4, Iterating, we get u; bounded in C™(M). Any limit point u € C°(M) 
solves (1.36) with T = o, so J is closed. 

We next show that J is open in [0,1]. That is, if 77 € J, 7 < 1, then, for some 
€ > 0, [7,7 + ¢€) C J. To do this, fix & large and define 


(1.40) W: [0,1] x H*(M)—+ H*-*(M), W(7,u) = Au— f,(a,u). 
This map is C', and its derivative with respect to the second argument is 
(1.41) D2V(1,u)u = Ly, 


where 
L:H*(M) — H*-7(M) 


is given by 
(1.42) Iv = Av—A(a)v, A(x) =1—7+700uf (az, u). 


Now, if f satisfies (1.3), then A(z) > 1 — 7, which is > 0 if 7 < 1. Thus L is 
an invertible operator. The inverse function theorem implies that U(7,u) = 0 is 
solvable for |r — T)| < €. We thus have the following: 


Proposition 1.9. If M is a compact manifold without boundary and if f(x, u) 
satisfies the conditions (1.3) and (1.34), then the PDE (1.1) has a smooth solution. 
If (1.3) is strengthened to 0, f(x, u) > 0, then the solution is unique. 
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The only point left to establish is uniqueness. If wu and v are two solutions, then, 
as in (1.22), we have for w = u — v the equation 


Aw =X(z)w, A(x) = [f(z,u) — f(z, v)]/(u—v) > 0. 


Thus 
—||Vull22 = / A(x) |w(2)|? aV, 


which implies w = 0 if A(x) > 0 on M. 

Note that if we only have A(z) > 0, then w must be constant (if M is 
connected), and that constant must be 0 if A(~) > O on an open subset 
of MM, so cases of nonuniqueness are rather restricted, under the hypotheses 
of Proposition 1.9. The reader can formulate further uniqueness results. 

It is possible to obtain solutions to (1.1) without the hypothesis (1.3) if we 
retain the hypothesis (1.34). To do this, first alter f(a,u) on u < ao and on 
u > a, toa smooth g(x, uw) satisfying g(z,u) = —Ko < 0 for u < ao — 6 and 
g(x,u) = Kk, > 0 for u > a, + 6, where 6 is some positive number. We want to 
show that, for each 7 € [0, 1], the equation 


(1.43) Au = (1—7)(u— 6) + T9(a2, u) = gr(a, u) 

is solvable, with solution satisfying (1.38). Convert (1.43) to 

(1.44) u = (A-1)7'(g-(a,u) — u) = ®,(u). 

Now each ®, is a continuous and compact map on the Banach space C'(M): 
(1.45) ®,:C(M)— C(M), 


with continuous dependence on T. For solvability we can use the Leray-Schauder 
fixed-point theorem, proved in Appendix B at the end of this chapter. Note that 
any solution to (1.44) is also a solution to (1.43) and hence satisfies (1.38). In 
particular, 


(1.46) u=®,(u) = llullocw < A = max(|ao|, |ai|). 


Since o(u) = —(A — 1)~1b = 6, which is independent of wu, it follows from 
Theorem B.5 that (1.44) is solvable for all 7 € [0,1]. We have the following 
improvement of Proposition 1.9. 


Theorem 1.10. Jf M is a compact manifold without boundary and if the func- 
tion f(x, u) satisfies the condition (1.34), then the equation (1.1) has a smooth 
solution, satisfying ay < u(x) < ay. 


The equation (1.31) for the conformal factor needed to adjust the curvature of a 
2-manifold to a desired K(x) satisfies the hypotheses of Theorem 1.10 (even those 
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of Proposition 1.9) in the special case when k(x) < 0 and K(x) < 0, yielding a 
special case of a result to be proved in § 2, where the assumption that k(a) < 0 
is replaced by x(M/) < 0. In some cases, Theorem 1.10 also applies to equations 
for such conformal factors in higher dimensions. When dim M = n > 3, we alter 
the metric by 


(1.47) gf = ull) g. 


The scalar curvatures o and S of the metrics g and g’ are then related by 


(1.48) C2 awh, Heed 2 ge 


n—2’ n—2’ 


where A is the Laplacian for the metric g. Hence, obtaining the scalar curvature 
S for g’ is equivalent to solving 


(1.49) yAu = o(x)u— S(x)u®, 


for a smooth positive function u. Note that a > 1 and y > 1. For n = 3, we have 
y=8anda=5. 

Note that (1.34) holds, for some a; satisfying 0 < ap < a, < ov, provided 
both o(a) and S(x) are negative on M. Thus we have the next result: 


Proposition 1.11. Let M be a compact manifold of dimension n > 2. Let g be a 
Riemannian metric on M with scalar curvature o. If both o and S are negative 
functions in C®(M), then there exists a conformally equivalent metric g' on M 
with scalar curvature S. 


An important special case of Proposition 1.11 is that if MZ has a metric with 
negative scalar curvature, then that metric can be conformally altered to one with 
constant negative scalar curvature. There is a very significant generalization of 
this result, first stated by H. Yamabe. Namely, for any compact manifold with 
a Riemannian metric g, there is a conformally equivalent metric with constant 
scalar curvature. This result, known as the solution to the Yamabe problem, was 
established by R. Schoen [Sch], following progress by N. Trudinger and T. Aubin. 


Note that (1.3) also holds in the setting of Proposition 1.11; thus to prove this 
latter result, we could appeal as well to Proposition 1.9 as to Theorem 1.10. Here 
is a generalization of (1.49) to which Theorem 1.10 applies in some cases where 
Proposition 1.9 does not: 


(1.50) yAu = B(a)u® + o(a)u— A(x)u®, B<1l<a. 
It is possible that 8 < 0. Then we have (1.34), for some a; > 0, and hence the 


solvability of (1.50), for some positive u € C™(M), provided A(x) and B(x) are 
both negative on M, for any a € C™~(M). If we assume A < 0 on M but only 
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B<0on M™, we still have (1.34), and hence the solvability of (1.50), provided 
o(z) <Oon{xe M: B(x) = 0}. 

An equation of the form (1.50) arises in Chap. 18, in a discussion of results of 
J. York and N. O’Murchadha, describing permissible first and second fundamental 
forms for a compact, spacelike hypersurface of a Ricci-flat spacetime, in the case 
when the mean curvature is a given constant. See (9.28) of Chap. 18. 

The search for stationary solutions to nonlinear wave equations leads to semi- 
linear elliptic PDE on noncompact domains. A device of P.-L. Lions known as 
concentration-compactness is useful for such problems. See [CMMT] for some 
results here, and references to other work. 


Exercises 


1. Assume f(x, w) is smooth and satisfies (1.6). Define F(x, u) and I(u) as in (1.4) and 
(1.5). Show that J has the strict convexity property (1.9) on the space V given by (1.8), 
as long as 


(1.51) 2 t(a,u) > ro, 
where Xo is the smallest eigenvalue of —A on M, with Dirichlet conditions on 0M. 
Extend Proposition 1.2 to cover this case, and deduce that the Dirichlet problem (1.1)- 
(1.2) has a unique solution u € C™(M), for any g € C® (OM), when f (a, wu) satisfies 
these conditions. 

2. Extend Theorem 1.6 to the case where f(x, w) satisfies (1.51) instead of (1.3). 
(Hint: To obtain sup norm estimates, use the variants of the maximum principle indi- 
cated in Exercises 5—7 of § 2, Chap. 5.) 

3. Let spec(—A) = {A;}, where 0 < Ao < Ay < ---. Suppose there is a pair A; << Aj441 
and € > 0 such that 


0 
Aji te S a f(a, u) S —Aj —«, 
for all x, u. Show that, for g € C™°(0M), the boundary problem (1.1)—(1.2) has a 


unique solution u € C™(M). 
(Hint: With pp = (Aj + Aj41)/2, u=v4+g, g € C~(M), rewrite (1.1)-(1.2) as 


(A+ p)u = f(z,u+ 9) +pu—G, 5p =O, 
where G = (A + j1)g, or 
(1.52) v= (A+p) [feu +9) + ur] —9 = Ov). 


Apply the contraction mapping principle.) 
4. In the context of Exercise 3, this time assume 


0 
—Ajpi +E aut eu) < —Aj-1—€, 


so Of /Ou might assume the value —A;. Take « = (Aj—-1 + Aj+1)/2, let Po be the 
orthogonal projection of L?(M) on the d; eigenspace of —A, and let P,) = I — Pp. 
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Writing 
u-g=v=Pov+ Piv=vw +01, 
convert (1.1)—(1.2) to a system 


V1= (A+p)'Pi [f(e, vo Fui+g)4 pv: | Pug, 
(1.53) 
vo = (u— Ay) Po [#(x, v0 +vu1+q) + p1v0] — Pog. 


Given vo, the first equation in (1.53) has a unique solution, v1 = E(vo), by the argument 
in Exercise 3. Thus the solvability of (1.1)—(1.2) is converted to the solvability of 


(1.54) Vo = (u _ Aj) Po [f(2, v0 + =(vo) + 9) + j1v0 | — Pog = W(vo). 


Here, W is a nonlinear operator on a finite-dimensional space. (Essentially, on the real 
line if A; is a simple eigenvalue of —A.) Examine various cases, where there will or 
will not be solutions, perhaps more than one in number. 

5. Given a Riemanian manifold M of dimension n > 3, with metric g and Laplace 
operator A, define the “conformal Laplacian” on functions: 


_ acl a 2 
where o(2) is the scalar curvature of (M,g). If g! = u4/("~?)g as in (1.47), and 
(M, g’) has scalar curvature S'(x), set 
(1.56) Ef =Af —ma'S(@)f, 


where A is the Laplace operator for the metric g’. Show that 
(1.57) L(uf) = u4/- uf. 


(Hint: First show that A(uf) — uu*/("-?) A f = (Au) f. Then use the identity (1.49).) 

6. Assume M is compact and connected. Let Ao be the smallest eigenvalue of —L = 
—A++7,,'o(x). A \o-eigenfunction v of L is nowhere vanishing (by Proposition 2.9 
of Chap. 8). Assume v(x) > 0 on M. Form the new metric g = v*/("~?) g. Show that 
the scalar curvature S of (M, g) is given by 


(1.58) S(x) = rov 4/0"), 
which is positive everywhere if Ao > 0, negative everywhere if Ao < 0, and zero if 
ro = 0. 

7. Establish existence for an ¢ x @ system 


Au= f(x,u), 


where M is a compact Riemannian manifold and f : M x R° — R*° satisfies the 
condition that, for some A < oo, 


jul > A= > f(a,u)-u>0. 
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(Hint: Replace f by +f, and let 0 < 7 < 1. Show that any solution to such a system 
satisfies |u(ax)| < A.) 

8. Let Q be a compact, connected Riemannian manifold with nonempty boundary. 
Consider 


(1.59) Au+ f(z,u) =0, wu an = 9 


for some real-valued u; assume f € C°(Q x R), g € C™(OQ). Assume there is an 
upper solution & and a lower solution u, in C?(Q) N C(Q), satisfying 

Au+ f(z,4) <0, U5. 29, 

Aut f(z,u)>0, Ula <9. 


Also assume u < %@ on 2. 

Under these hypotheses, show that there exists a solution u € C°(Q) to (1.59), such 
thatu<u<U. 

One approach. Let K = {v € C(Q) : u < v < @, which is a closed, bounded, 
convex set in C(Q). Pick \ > 0 so that |0,, f(x, u)| < A, for min u < u < max @. 
Let ®(v) = w be the solution to 


Aw — Aw = —Av — f(z,v), W| 50 = 4g. 


Show that ® : K — K continuously and that ®( 4’) is relatively compact in A’. Deduce 
that ® has a fixed point u € K. 
Second approach. If uo = wand uj+1 = ®(u;), show that 


U=Up SU Ss Suz<-s- <u 


and that u; 7 u, solving (1.59). 


2. Surfaces with negative curvature 


In this section we examine the possibility of imposing a given Gauss curvature 
K(a) < 0 ona compact surface M without boundary, by conformally altering 
a given metric g, whose Gauss curvature is k(x). As noted in § 1, if g and g’ are 
conformally related, 


(2.1) g = "9, 
then K and & are related by 
(2.2) K(x) = e*“(—Au-+ k(2)), 


where A is the Laplace operator for the original metric g, so we want to solve the 
PDE 


(2.3) Au = k(x) — K(x)e™. 
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This is not possible if M is diffeomorphic to the sphere S? or the torus T?, by 
virtue of the Gauss—Bonnet formula (proved in § 5 of Appendix C): 


(2.4) i kdV = / Ke dV = 21x(M), 
M M 


where dV is the area element on M, for the original metric g, and x(M) is the 
Euler characteristic of /. We have 


(2.5) YS?) = 2, o(T*) = 0. 


For us to be able to arrange that K < 0 be the curvature of M, it is necessary 
for x(M) to be negative. This is the only obstruction; following [Bgr], we will 
establish the following. 


Theorem 2.1. Jf M is a compact surface satisfying y(M) < 0, with given 
Riemannian metric g, then for any negative K € C®(M), the equation (2.3) 


has a solution, so M has a metric, conformal to g, with Gauss curvature K(<). 


We will produce the solution to (2.3) as an element where the function 


(2.6) r= i (J laul? + k(2)u) dV 
M 
on the set 
(2.7) S={ue H'(M): [Roe dV = 2rx(M)} 
M 


achieves a minimum. Note that the Gauss—Bonnet formula is built into (2.7), since 
a metric g’ = e?“g has volume element e?“dV. While providing an obstruction 
to specifying K(a), the Gauss—Bonnet formula also provides an aid in making a 
prescription of K'(a) < 0 when it is possible to do so, as we will see below. 


Lemma 2.2. The set S is a nonempty C'-submanifold of H'(M) if K < 0 and 
x(M) < 0. 


Proof. Set 
(2.8) &(u) = e. 
By Trudinger’s inequality, 


(2.9) &: H'(M) —> L?(M), 
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for all p < oo. Take p = 1. We see that © is differentiable at each u € H'(M) 
and 


(2.10) D®(u)v = 2e"v, D&(u): H1(M) > L'(M). 


Furthermore, 


|(D8u) ~ D&(w)) oll ,rcay) $2 f lol “Lee” - "| av 


M 


(2.11) 1/4 i i 
<2 (/ |v|4 wv) (/ lu _ w|* wv) (/ eAlul+4le| wv) 


< Ollollzs - lu — wll exp [C(llullzn + llwllx)] 


so the map ® : H'(M) — L'(M) is a C1-map. Consequently, 


(2.12) J(u) = i dV => J: H'(M) >R isaC'-map. 
M 


Furthermore, DJ(u) = 2K e?“, as an element of H~'(M) ~ L(H'(M),R), 
so DJ(u) # 0 on S. The implicit function theorem then implies that S' is a 
C1-submanifold of H!(M). If K < 0 and x(M) < 0, it is clear that there is a 
constant function in S, so S # 0. 


Lemma 2.3. Suppose F : S — R, defined by (2.6), assumes a minimum atu € S. 
Then u solves the PDE (2.3), provided the hypotheses of Theorem 2.1 hold. 


Proof. Clearly, F : 5 — R is a C'-map. If 7(s) is any C*-curve in S with 
(0) = u, y/(0) = v, we have 


0= < F(ut+ sv)| 9 = [[aw,av) + k(x)v] dV 
M 


= [cau + k(a))v dV. 


M 


(2.13) 


The condition that v is tangent to S' at u is 


(2.14) / Ke2+s") dv = 2nx(M) + O(s”), 
M 


which is equivalent to 
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(2.15) / vKe™ dV =0. 
M 


Thus, if uw € S is a minimum for F’, we have 
v € H'(M), pee dV =0=>> [aut k(x))v dV =0, 
M M 
and hence —Au + k(z) is parallel to Ke?“ in H*(M); that is, 
(2.16) —Au+k(x) = BKe™, 


for some constant 3. Integrating and using the Gauss-Bonnet theorem yield @ = 1 
if y(M) 4 0. 

By Trudinger’s estimate, the right side of (2.16) belongs to L?(M), so u € 
H?(M). This implies e?“ € H?(M), and an easy inductive argument gives u € 
C™@(M). 


Our task is now to show that F has a minimum on S, given kK <0 and 
y(M) <0. Let us write, for any u € H1(M), 


(2.17) UuU=uota, 
where a = (Area M)~' [,,u dV is the mean value of u, and 
(2.18) up € H(M) = {ve H'(M): [ vav =0}. 
M 
Then u belongs to S if and only if 
eo / Ke dV = 2rx(M), 
M 


or equivalently, 

1 2u 
(2.19) a = 5 log [2nx(M) / Ker av]. 
Thus, for wu € S, 


F(u) = J Glaeol 2: keuo dV 
M 
(2.20) 


4+ mx(M) ¢ log 2n|x(M)| — log | | Kem av| 
M 
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Lemma 2.4. If y(M) < Oand K < 0, then infg F(u) = a > —oo. 


Proof. By (2.20), we need to estimate 


—(M) log | i Ker av| 
M 


from below. Indeed, granted that K(x) < —d < 0, 
[Kew dV< -5 f om dV. 
Since e* > 1+ 2, we have [ e?"° dV > [dV + f[ 2uo dV = area M, so 


[Kew dV >-dA (A= AreaM), 


M 
and hence 
(2.21) —x(M) log | [| Kem av| > |x(M)| log [6A] > b > —o0. 
M 
Thus, for u € S, 
1 2 
(2.22) F(u) = 5 [duo + kug } dV — Cz, 
M 


with C2 independent of ug € H!(M). Now, since ||uo|| 2 < C||duoll z2. 


C 
(2.23) if tug av| < Opel|duollz2 + —, 
M 


with C3 and C, independent of ¢. Taking « = 1/2C3, we get F(u) > —C3C4 — 
C2, which proves the lemma. 


We are now in a position to prove the main existence result. 


Proposition 2.5. If M and K are as in Theorem 2.1, then F achieves a minimum 
at a point u € S, which consequently solves (2.3). 


Proof. Pick u, € S so thata+1 > F(un) \ a. If we use (2.22) and (2.23), 
with « = 1/4C3, we have 


i 
(2.24) a+1> jlldunoll ze =(; 
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where Uno = Un— mean value. But the mean value of uy, is 
1 2u 
5 ee [2rx(M) / Kern av] 
M 


which is bounded from above by the proof of Lemma 2.4. Hence 
(2.25) Un is bounded in H'(M). 

Passing to a subsequence, we have an element u € H'(M) such that 
(2.26) Un —>u weakly in H'(M). 


By Proposition 4.3 of Chap. 12, e?%" — e?“ in L1(M), in norm, so u € S. Now 
(2.26) implies that [,, k(x)un dV — J, k(x)u dV and that 


(2.27) / |du|? dV < lim inf / |dun|? dV, 
M M 
so F(u) <a= f, F(v), and the existence proof is completed. 

The most important special case of Theorem 2.1 is the case kK = —1. For any 
compact surface with y(M) < 0, given a Riemannian metric g, it is conformally 
equivalent to a metric for which K = —1. The universal covering surface 
(2.28) M — M, 


endowed with the lifted metric, also has curvature —1. A basic theorem of 
differential geometry is that any two complete, simply connected Riemannian 
manifolds, with the same constant curvature (and the same dimension), are iso- 
metric. See the exercises for dimension 2. For a proof in general, see [ChE]. One 
model surface of curvature —1 is the Poincaré disk, 


(2.29) D={(z,y) ER: a? ty’? <l={zeC: |z| <1}, 
with metric 
(2.30) ds? = A(1 — 2? — y*)~? (da? + dy’). 


This was discussed in § 5 of Chap. 8. Any compact surface M with negative Euler 
characteristic is conformally equivalent to the quotient of D by a discrete group [° 
of isometries. If // is orientable, all the elements of I preserve orientation. 

A group of orientation-preserving isometries of D is provided by the group G 
of linear fractional transformations, where 
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(2.31) ees g= ‘ Ns 


for 
(2.32) g€G=SU(1,1) = {(" a :u,v eC, jul? — |u|? = i} 
VU 


It is easy to see that G acts transitively on D; that is, for any 21, z2 € D, there 
exists g € G such that Tz; = za. We claim {T, : G € G} exhausts the group 
of orientation-preserving isometries of D. In fact, let T be such an isometry of D; 
say T(0) = zo. Pick g € G such that T,z = 0. Then T, o T is an orientation- 
preserving isometry of D, fixing 0, and it is easy to deduce that T,, o T’ must be a 
rotation, which is given by an element of G. 

Since each element of G' defines a holomorphic map of D to itself, we have 
the following result, a major chunk of the uniformization theorem for compact 
Riemann surfaces: 


Proposition 2.6. If M is a compact Riemann surface, x(M) < 0, then there is a 
holomorphic covering map of M by the unit disk D. 


Let us take a brief look at the case y(/Z) = 0. We claim that any metric g on 
such M is conformally equivalent to a flat metric g’, that is, one for which kK = 0. 
Note that the PDE (2.3) is linear in this case; we have 


(2.33) Au = k(2). 


This equation can be solved on © if and only if 


(2.34) fro dV =0, 
M 


which, by the Gauss—Bonnet formula (2.4) holds precisely when y(M/) = 0. In 


this case, the universal covering surface M of MM inherits a flat metric, and it must 
be isometric to Euclidean space. Consequently, in analogy with Proposition 2.6, 
we have the following: 


Proposition 2.7. If M is a compact Riemann surface, y(M) = 0, then M is holo- 
morphically equivalent to the quotient of C by a discrete group of translations. 


By the characterization 
x(M) = dim H°(M) — dim H'(M) + dim H?(M) = 2— dim H'(M), 


if M is a compact, connected Riemann surface, we must have y(M) < 2. If 
x(M) = 2, it follows from the Riemann-Roch theorem that M is conformally 
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equivalent to the standard sphere S? (see § 10 of Chap. 10). This implies the fol- 
lowing. 


Proposition 2.8. If M is a compact Riemannian manifold homeomorphic to S?, 
with Riemannian metric tensor g, then M has a metric tensor, conformal to g, 
with Gauss curvature = 1. 


In other words, we can solve for u € C'°(M/) the equation 
(2.35) Au = k(x) —e™, 


where k(x) is the Gauss curvature of g. This result does not follow from Theorem 
2.1. A PDE proof, involving a nonlinear parabolic equation, is given by [Chow], 
following work of [Ham]. An elliptic PDE proof, under the hypothesis that M/ has 
a metric with Gauss curvature k(x) > 0, has been given in Chap. 2 of [CK]. 

We end this section with a direct Jinear PDE proof of the following, which as 
noted above implies Proposition 2.8. This argument appeared in [MT]. 


Proposition 2.9. If M is a compact Riemannian manifold homeomorphic to S?, 
there is a conformal diffeomorphism F : M — S$? onto the standard Riemann 
sphere. 


Proof. Pick a Riemannian metric on WM, compatible with its conformal structure. 
Then pick p € M, and pick h € D’(M), supported at p, given in local coordinates 
as a first-order derivative of 6, (plus perhaps a multiple of 6,), such that (1,h) = 
0. Hence there exists a solution u € D’(M) to 


(2.36) Au=h. 


Then u € C%°(M \ p), and wu is harmonic on M \ p and has a dist(x, p)~+ type of 
singularity. Now, if M is homeomorphic to $7, then M \ p is simply connected, 
so u has a single-valued harmonic conjugate on M \ p, given by v(a) = ie «du, 
where we pick g € M \ p. We see that v also has a dist(a, p)~! type singularity. 
Then f = u + tv is holomorphic on M \ p and has a simple pole at p. From here 
it is straightforward that f provides a conformal diffeomorphism of 1 onto the 
standard Riemann sphere. 


Actually, the bulk of [MT] dealt with an attack on the curvature equation (2.3), 
with M a planar domain and K = —1, so the equation is 


(2.37) Au=e™ on QCC. 


Here is one of the main results of [MT]. 


Proposition 2.10. Assume Q = C \ S, where S is a closed subset of C with more 
than one point. Then there exists a solution to (2.37) on Q such that e2" (dx?+dy") 
is a complete metric on Q with curvature = —1. 
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As with Proposition 2.6, this has as a corollary the following special case of 


the general uniformization theorem. 


Corollary 2.11. [fQ C C is as in Proposition 2.10, there exists a holomorphic 
covering of Q by the unit disk D. 


Techniques employed in the proof of Proposition 2.10 include maximal princi- 


ple arguments and barrier constructions. We refer to [MT] for further details. 


Exercises 


iF 


Let M be a complete, simply connected 2-manifold, with Gauss curvature K = —1. 
Fix p € M, and consider 


Exp, :R? + T,M — M. 


Show that this is a diffeomorphism. 

(Hint: The map is onto by completeness. Negative curvature implies no Jacobi fields 
vanishing at 0 and another point, so D Exp,, is everywhere nonsingular. Use simple 
connectivity of M to show that Exp,, must be one-to-one.) 

For M as in Exercise 1, take geodesic polar coordinates, so the metric is 


ds? = dr* + G(r, 0) d0”. 
Use formula (3.37) of Appendix C, for the Gauss curvature, to deduce that 
avG =VG 
if K = —1. Show that 
VG(0,0)=0, 0,VG(0,@) = 1, 


and deduce that G(r, @) = y(r) is the unique solution to 


Deduce that 
G(r,0) = sinh? r. 


Using Exercise 2, deduce that any two complete, simply connected 2-manifolds with 
Gauss curvature kK = —1 are isometric. Use (3.37) or (3.41) of Appendix C to show 
that the Poincaré disk (2.30) has this property. 


3. Local solvability of nonlinear elliptic equations 


We take a look at nonlinear PDE, of the form 


(3.1) f(x, D™u) = g(a), 
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where, in the latter argument of f, 
(3.2) D™u = {D°u: |a| < m}. 


We suppose f(x, ¢) is smooth in its arguments, 7 € Q C R", and ¢ = {¢, : 
|a| < m}. The function u might take values in some vector space R*. Set 


(3.3) F(u) = f(z, D™u), 


so F : C@(Q) + C%(Q); F is the nonlinear differential operator. Let uo € 
C™(Q). We say that the linearization of F at up is DF'(uo), which is a linear 
map from C™(Q) to C(Q). (Sometimes less smooth uo can be considered.) We 
have 

of 


) 
_ _ m B 
(3.4) DF(uo)v = oF F(up + sv)|,_5 = Pay Oa (x, D™uo) D°v, 


so DF (uo) is itself a linear differential operator of order m. We say the operator 
F is elliptic at uo if its linearization DF'(u,) is an elliptic, linear differential 
operator. 

An operator of the form (3.3) with 


(3.5) f(a, D™u) = S- Q(t, D™-1u)D%ut filz, D™*u) 


ja|=m 


is said to be quasi-linear. In that case, the linearization at up is 


(3.6) DF(u) = S> da(x,D™~'ug)D%v + Lo, 


|o|=m 


where L is a linear differential operator of order m—1, with coefficients depending 
on D™~1tug. A nonlinear operator that is not quasi-linear is called completely 
nonlinear. The distinction is made because some aspects of the theory of quasi- 
linear operators are simpler than the general case. 

An example of a completely nonlinear operator is the Monge—Ampere operator 


(3.7) F(u) = det (i= >) = Urey — Uys 


Uy Uyy 


with (x,y) € 2 C R?. In this case, 


DF (uo =| (Ye tev) ( My ey) 
(3.8) Vay Vyy —Uny Ura 


= UyyVrr — 2UayVey + Ure Vyy- 
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Thus the linear operator DF'(u) acting on v is elliptic provided the matrix 


(3.9) ( “uy Pug 
—Uny Una 
is either positive-definite or negative-definite. Since, for u real-valued, this is a 


real symmetric matrix, we see that this condition holds precisely when F'(u) > 0. 
More generally, for Q C R”, we consider the Monge—Ampere operator 


(3.7a) F(u) = det H(u), 


where H(u) = (0;0,u) is the Hessian matrix of second-order derivatives. In this 
case, we have 


(3.8a) DF(u)v = Tr [C(u) H(v)], 


where H(v) is the Hessian matrix for v and C(w) is the cofactor matrix of H(w), 
satisfying 
H(u)C(u) = [det H(u)|T. 

In this setting we see that DF'(w) is a linear, second-order differential operator that 
is elliptic provided the matrix C(w) is either positive-definite or negative-definite, 
and this holds provided the Hessian matrix H(u) is either positive-definite or 
negative-definite. 

Having introduced the concepts above, we aim to establish the following local 
solvability result: 


Theorem 3.1. Let g © C™(Q), and let uy, € C™(Q) satisfy 
(3.10) F(u1) = g(x), atx = x0, 


where F'(u) is of the form (3.3). Suppose that F is elliptic at u,. Then, for any ¢, 
there exists u € C*(Q) such that 


(3.11) F(u) =g 


on a neighborhood of xo. 


We begin with a formal power-series construction to arrange that (3.11) hold 
to infinite order at x9. 


Lemma 3.2. Under the hypotheses of Theorem 3.1, there exists uy € C™(Q) 
such that 


(3.12) F (uo) — g(x) = O(|x — xo|°) 
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and 
(3.13) (up — u)(x) = O(|x — x|""*"). 
Proof. Making a change of variable, we can suppose x9 = 0. Denote coordinates 
near 0 in 2 by (x,y) = (@1,...,Un—1, y). We write uo(x, y) as a formal power 
series in y: 
1 k 
(3.14) Up(, Y) = vo(x) + v1 (a)y + ++° + py veley ee 
Set 


(3.15) vo(x) = ur(z,0), vi (x) = Oyur(a,0),...,Um—1(2) = sige ee 0). 
Now the PDE F'(u) = g can be rewritten in the form 


o™ 
(3.16) an = F#(2,y, Du, D™-!Dyu,..., DLD™1u). 


Then the equation for v,,(x) becomes 

(3.17) Um(x) = f# (2,0, D™v9(x),..., Dlum_1(z)). 

Now, by (3.10), we have v(0) = Of’ui(0,0), so (3.13) is satisfied. Taking 
y-derivatives of (3.16) yields inductively the other coefficients v; (a), 7 > m+ 1, 


and the lemma follows from this construction. 


Note that if F’ is elliptic at w;, then Ff’ continues to be elliptic at wo, at least on 
a neighborhood of «9; shrink 2. appropriately. 
To continue the proof of Theorem 3.1, fork > m+ 1+ 7/2, we have that 


(3.18) FP: H®(Q) 3 H*™(Q) 

is a C!-map. We have 

(3.19) L= DF(up) + H*(Q) — H-™ (0). 

Now, CL is an elliptic operator of order m. We know from Chap. 5 that the Dirichlet 
problem is a regular boundary problem for the strongly elliptic operator £L*. 
Furthermore, if 9 is a sufficiently small neighborhood of xo, the map 


C20 LLY: HE*™(Q) 1. HS*(Q) — H-™(Q) 


is invertible. Hence the map (3.19) is surjetive, so we can apply the implicit func- 
tion theorem. For any neighborhood B; of uo in H*(Q), the image of B;, under the 


3. Local solvability of nonlinear elliptic equations 133 


map F' contains a neighborhood C;, of F(uo) in H*~™(Q). Now if (3.12) holds, 
then any neighborhood of r(x) = F(uo) —g in H*—™(Q) contains functions that 
vanish on a neighborhood of 29, so any neighborhood C;, of F'(ug) contains func- 
tions equal to g(x) on a neighborhood of xo. This establishes the local solvability 
asserted in Theorem 3.1. 

One would rather obtain a local solution u € C'°° than just an ¢-fold differen- 
tiable solution. This can be achieved by using elliptic regularity results that will 
be established in the next section. 

We now discuss a refinement of Theorem 3.1. 


Proposition 3.3. [fu ,g € C™(Q) satisfy the hypotheses of Theorem 3.1 at x = 
x9, with F elliptic at uy, then, for any @, there exists u € C*(Q) such that, on a 
neighborhood of Xo, 


(3.21) Flu) =g 
and, furthermore, 
(3.22) (u—u1)(x) = O(|a — ao|"*"). 

In the literature, one frequently sees a result weaker than (3.22). The desir- 
ability of having this refinement was pointed out to the author by R. Bryant. As 
before, results of the next section will give u € C™(Q). 

To begin the proof, we invoke Lemma 3.2, as before, obtaining wo. Now, for 


k>m+1+n/2, set 


Ve = fu € H*(Q) : (u—uo)(x) = O(\a — xol™*)}, 


(3.23) 

Gu—m = {h € H*-™(Q) : h(to) = g(ao)}. 
Then 
(3.24) F: Vp — Gm 


is a C!-map, and we want to show that F' maps a neighborhood of uo in Vz onto 
a neighborhood of go = F'(uo) in G,_m. We will again use the implicit function 
theorem. We want to show that the linear map 


(3.25) L= DF(up): Vv? — Ge_n 
is surjective, where 


v? = {v € H*(Q) : D8 v(x) = 0 for |B| < m}, 


3.26 
oe) G?_, = {he H*-™(Q) : h(xq) = 0} 


are the tangent spaces to V; and G,_m, at uo and go, respectively. 


134 14. Nonlinear Elliptic Equations 


By the previous argument involving (3.19) and (3.20), we know that, for any 
given h € CP os we can find v1 € H*(Q) such that Lv; = h, perhaps after 
shrinking (. To prove the surjectivity in (3.25), we need to find v € H*(Q) such 
that £yv = 0 and such that v — v; = O(|x — xo|'""**), so that vy; — v € VP and 
L(v, — v) = h. We will actually produce v € C%°(Q). To work on this problem, 
we will find it convenient to use the notion of the m-jet Jj”(v) of a function v € 
C™(Q), at xo, being the Taylor polynomial of order m for v about 29. Note that 


(3.27) Je" (v) = JS (v*) > (v — v*) (x) = O(|az — ao|™*), 


given that v,v* © C%(Q). The existence of the function v we seek here is 
guaranteed by the following assertion. 


Lemma 3.4. Given an elliptic operator L of order m, as above, let 


(3.28) J =i deus Lu(eg) = 0} 
and 
(3.29) S={J'(v):v Ee C°(Q), Lu =00nQ}. 


Clearly, S C J. If Q is a sufficiently small neighborhood of xo, then S = J. 


Proof. This result is a simple special case of our goal, Proposition 3.3; the 
beginning of the proof here just retraces arguments from the beginning of that 
proof. Namely, let v;) € C°°(Q) have m-jet in J, hence satisfying Lui (xo) = 0. 
Then Lemma 3.2 applies, so there exists vg such that 


(3.30) Jp’ (vo) = J" (v1) and Lug = O(|x — xo|°°). 


Set ho = Luo. Suppose (2 is shrunk so far that ££* in (3.20) is an isomorphism. 
Now, for any € > 0, there exists hy € C®(Q)) such that 


(3.31) hy =honear xo, |[hill ea) < €- 
Then the Dirichlet problem 
LL*w=h,onQ, we Af"(Q) 
has a unique solution w satisfying estimates 
(3.32) || @|| e+2m (ay < Cellhall wea). 
Fix £ > n/2. By Sobolev’s imbedding theorem, w = L*w satisfies 


(3.33) Ilwllom(ay < CF || wll ze+m @)- 
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In light of this, we have 
(3.34) \lwllom(ay < C#e, Lw =h, on, 


so v = v1 —w defines an element in S, provided (2 is shrunk to 1, on which hy = 
ho in (3.31). Furthermore, Jj”(v) differs from Jj” (v1) by Jj”(w), which is small 
(i.e., proportional to ¢). Since S is a linear subspace of the finite-dimensional 
space /, this approximability yields the identity S = 7 and proves the lemma. 


From the lemma, as we have seen, it follows that the map (3.25) is a surjective 
linear map between two Hilbert spaces, so the implicit function theorem therefore 
applies to the map F' in (3.24). In other words, Ff maps a neighborhood of up in 
VY; onto a neighborhood of gg = F'(ug) in Gym. As in the proof of Theorem 3.1, 
we see that any neighborhood of r(x) = F(uo) — g in G?_,,, contains functions 
that vanish on a neighborhood of 2, so any neighborhood of F'(uo) in Ge—m 
contains functions equal to g(x) on a neighborhood of xo. This completes the 
proof of Proposition 3.3. 

In some geometrical problems, it is useful to extend the notion of ellipticity. 
A differential operator of the form (3.3) is said to be underdetermined elliptic at 
uo provided DF'(uo) has surjective symbol. 


Proposition 3.5. [f F(u1) satisfies F(u,) = g at x = Xo, and if F is underdeter- 
mined elliptic at uy, then, for any &, there exists u € C*(Q) such that F(u) = g 
on a neighborhood of xo and such that (u — u,)(x) = O(|x — ao|"*"). 


Proof. We produce u in the form wu = u; + u2, where we want 
(3.35) F(u, +2) =gnearap, u2(x) = O(|x — xo|""*"). 


We will find uz in the form ug = L*w, where £ = DF‘(u1). Thus we want to 
find w € C+ (Q) satisfying 


(3.36) (w)= F(u,+L*w) =gnearro, w(x) = O(|x — x9|?"*"). 


Now ®(w) is strongly elliptic of order 2m at w; and ®(w ) = 0 at xo if w; = 0. 
Thus the existence of w satisfying (3.36) follows from Proposition 3.3, and the 
proof is finished. 


We will apply the local existence theory to establish the following classical 
local isometric imbedding result. 


Proposition 3.6. Let M be a 2-dimensonal Riemannian manifold. If pp © M and 
the Gauss curvature K (po) > 0, then there is a neighborhood O of po in M that 
can be smoothly isometrically imbedded in R®. 


The proof involves constructing a smooth, real-valued function wu on O such 
that du(po) = 0 and such that g; = g — du? is a flat metric on ©, where g is 
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the given metric tensor on M. Assuming this can be accomplished, then by the 
fundamental property of curvature (Proposition 3.1 of Appendix C), we can take 
coordinates (a, y) on O (after possibly shrinking ©) such that g, = dx? + dy’. 
Thus g = dx? + dy? + du?, which implies that (x,y, uw) : O + R? provides the 
desired local isometric imbedding. 

Thus our task is to find such a function wu. We need a formula for the Gauss 
curvature K, of O, with metric tensor gj = g — du’. A lengthy but finite 
computation from the fundamental formulas given in §3 of Appendix C yields 


(3.37) (1 —|Vul?)"K, = (1—|Vul?)K — det H,(u). 


Here, |Vu|? = g/"u.ju.x~, and H,(w) is the Hessian of u relative to the Levi-Civita 
connection of g: 


(3.38) H,(u) = (wx). 


This is the tensor field of type (1,1) associated to the tensor field V7u of type (0,2), 

such as defined by (2.3)—(2.4) of Appendix C, or equivalently by (3.27) of Chap. 2. 

In normal coordinates centered at p € M, we have H,(u) = (0;0,u), at p. 
Therefore, g; is a flat metric if and only if u satisfies the PDE 


(3.39) det H,(u) = (1 —|Vul?)K. 


By the sort of analysis done in (3.7)-(3.9), we see that this equation is elliptic, 
provided kK > 0 and |Vu| < 1. Thus Proposition 3.3 applies, to yield a local 
solution u € C*(O), for arbitrarily large 2, provided the metric tensor g is smooth. 
As mentioned above, results of § 4 will imply that u€ C(O). 

If K(po) < 0, then (3.39) will be hyperbolic near po, and results of Chap. 16 
will apply, to produce an analogue of Proposition 3.6 in that case. No matter 
what the value of K (po), if the metric tensor g is real analytic, then the nonlinear 
Cauchy—Kowalewsky theorem, proved in § 4 of Chap. 16, will apply, yielding in 
that case a real analytic, local isometric imbedding of M into R?. 

If M is compact (diffeomorphic to S*) and has a metric with K > 0 every- 
where, then in fact M can be globally isometrically imbedded in R®. This result 
is established in [Ni2] and [Po]. Of course, it is not true that a given compact 
Riemannian 2-manifold M can be globally isometrically imbedded in R® (for 
example, if K < 0), but it can always be isometrically imbedded in R% for suf- 
ficiently large N. In fact, this is true no matter what the dimension of M. This 
important result of J. Nash will be proved in § 5 of this chapter. 


Exercises 


1. Given the formula (3.8a) for the linearization of F(u) = det H(u), show that the 
symbol of DF'(u) is given by 
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(3.40) TDF(u)(2,€) = —C(ujé- €. 


2. Let a surface M Cc R®* be given by x3 = u(x1, £2). Given K (a1, x2), to construct u 
such that the Gauss curvature of M at (x1, 2, u(#1, %2)) is equal to K(x, x2) is to 
solve 


(3.41) det H(u) = (1+|Vul?)?K. 


See (4.29) of Appendix C. If one is given a smooth K (21,22) > 0, then this PDE is 
elliptic. Applying Proposition 3.3, what geometrical properties of MM can you prescribe 
at a given point and guarantee a local solution? 

3. Verify (3.37). Compare with formula (**) on p. 210 of [Spi], Vol. 5. 

4. Show that, in local coordinates on a 2-dimensional Riemannian manifold, the left side 
of (3.39) is given by 


det(u’;,) = g~' det(0;0;,u) + A’* (x, Vu) OjOxu + Q(Vu, Vu), 
where g = det(g;x), 


AP* (x, Vu) = 97% oF eu, 


with “+” if 7 =k, “—” if 7 4 k, j’ and k’ the indices complementary to j and k, and 


of, = Ang? + Png 


and 
Q(Vu, Vu) = det(r?,), 77” = 07 ,Oeu. 
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Here we will discuss two methods of establishing regularity of solutions to nonlin- 
ear elliptic PDE. The first is to consider regularity for a linear elliptic differential 
operator of order m 


(4.1) A(z,D)= 5° ag(z) D*, 


|a|<m 


whose coefficients have limited regularity. The second method will involve use 
of paradifferential operators. For both methods, we will make use of the Hélder 
spaces C’*(IR”) and Zygmund spaces C’s(IR”), discussed in § 8 of Chap. 13. Mate- 
rial in this section largely follows the exposition in [T]. 

Let us suppose aq(xz) € C*(R”), s € (0,00) \Z. Then A(z, €) belongs to the 
symbol space C?S7"o, as defined in § 9 of Chap. 13. Recall that p(x, €) € C2S7"5, 
provided 


(4.2) |D2p(z, €)| < Caley” lal 
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and 


(4.3) |Dep(-, €)| C3(R") < Ow (ey lal+ée, 


We would like to establish regularity results for elliptic A(x,€) € C?Sj", by 
pseudodifferential operator techniques. It is not so convenient to work with an 
operator with symbol A(z, €)~!. Rather, we will decompose A(z, €) into a sum 
(44) A(z, €) = A* (zx, £) + Aa, &), 

in such a way that a good parametrix can be constructed for A#(2,D), while 
A®(x, D) is regarded as a remainder term to be estimated. Pick 6 € (0,1). As 
shown in Proposition 9.9 of Chap. 13, any A(x, €) € C?Sj can be written in the 
form (4.4), with 

(4.5) A#(2,€)€ ST, A(x, 6) eCpsTy. 

To A?(a, D) we apply Proposition 9.10 of Chap. 13, which, we recall, states that 
(4.6) p(z,€) € CZSt's => p(z, D): cer _.¢cT, -d-d)s<r<s. 
Consequently, 


(4.7) PD rr aC, Sl eres, 


Now let p(a, D) € OPS, %" be a parametrix for A*(x, D), which is elliptic. 
Hence, mod C, 


(4.8) p(x, D) A(x, D)u = u+ p(x, D) A(x, D)u, 
so if 
(4.9) A(x, D)u = f, 


then, mod C°, 

(4.10) u = p(a, D) f — p(x, D) A(z, D)u. 
In view of (4.7), we see that when (4.10) is satisfied, 

(4.11) meorr-s fect = wer, 


We then have the following. 
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Proposition 4.1. Let A(a,€) € C2 Sj" be elliptic, and suppose u solves (4.9). 
Assuming 


(4.12) s>0, 0<d6<1 and -(1-d)s<r<s, 
we have 
(4.13) regen, feCi as 4eC", 


Note that, for Ja] =m, D°u € C’~°*s, and r—ds could be negative. However, 
Ao(x) Du will still be well defined for ag € C’®. Indeed, if (4.6) is applied to the 
special case of a multiplication operator, we have 


(4.14) aeC®, we Cl = aveC?, for-—s<o<s. 


Note that the range of r in (4.12) can be rewritten as —s < r — 0s < (1—0)s. If 
we set r —0s = —s +e, this means 0 < € < (2—4)s, so we can rewrite (4.13) as 


(4.15) ueCc™ te fecti=—uec™, providede >0, r<s, 


as long as the relation r = —(1 — 6)s + € holds. Letting 6 range over (0,1), 
we see that this will hold for any r € (—s + €,€). However, if r € [e,s), we 
can first obtain from the hypothesis (4.15) that u € Cr? for any p < €. This 
improves the a priori regularity of u by almost s units. This argument can be 
iterated repeatedly, to yield: 


Theorem 4.2. If A(x,€) € C* Si" is elliptic and u solves (4.9), then (assuming 
s >0) 


mec” feCc —=uec", 


(4.16) ; 
provided e>0 and —s<r<-s. 


We can sharpen this up to obtain the following Schauder regularity result: 


Theorem 4.3. Under the hypotheses above, 
(4.17) ee, fe vec, 


Proof. Applying (4.16), we can assume u € Ct" with s — r > 0 arbitrarily 
small. Now if we invoke Proposition 9.7 of Chap. 13, which says 


(4.18) p(x, €) € C™ST, => p(a, D): Certs CT, 
for all ¢ > 0, we can supplement 4.7 with 


(4.19) A’ D) vO rere CO, eS 0. 
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If 6 > 0, andif¢ > 0 is picked small enough, we can write m+s—ds+e =m+r 
with r < s, so, under the hypotheses of (4.17), the right side of (4.8) belongs to 
C*S, proving the theorem. We note that a similar argument also produces the 
regularity result: 


(4.20) ye Here? fec*—uwec™, 


We now apply these results to solutions to the quasi-linear elliptic PDE 


(4.21) Qo,(z,D™-1u) D*u = f. 


|a|<m 
As long as u € C™—-1*8, ag(xz,D™-1u) € C%. If also u € C™§**, we 
obtain (4.16) and (4.17). If r > s, using the conclusion u € C’”*S, we obtain 
dq(x,D™~'u) € Ost, so we can reapply (4.16) and (4.17) for further regular- 
ity, obtaining the following: 
Theorem 4.4. [fu solves the quasi-linear elliptic PDE (4.21), then 


(4.22) weCo™ Ac =, fect = vec”, 


provided s > 0, € > 0, and —s < r. Thus 


(4.23) eect FeC —S—uEer*, 
provided 

1 
(4.24) a5) r>s-l. 


We can sharpen Theorem 4.4 a bit as follows. Replace the hypothesis in 
(4.22) by 


(4.25) geC? Ann, 


with p € (1,00). Recall that Proposition 9.10 of Chap. 13 gives both (4.6) and, 
for p € (1,00), 


p(x, €) € C2ST, => p(x, D): Ht? — A”, 


(4.26) 
-(l-dO)s<r<s. 


Parallel to (4.14), we have 


(4.27) aéC®, ue H°? = aue H’?, for -—-s<a<s, 
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as a consequence of (4.26), so we see that the left side of (4.21) is well defined 
provided s + 0 > 1. We have (4.8) and, by (4.26), 


(4.28) Ae De ror? FP. for (1 oie eS 8, 
parallel to (4.7). Thus, if (4.25) holds, we obtain 

(4.29) p(z, D)A®(2, D)u € H™-1tet eer | 

provided —(1 — 5)s < 6s —1+0 < 5, i.e., provided 

(4.30) s+o>1and —l+o+0s<-s. 

Thus, if f € H?? with p > o — 1, we manage to improve the regularity of u 


over the hypothesized (4.25). One way to record this gain is to use the Sobolev 
imbedding theorem: 
ésp 


(431) Hm Metis cpm itm, py = Ph _ 5 (14?) 
n—06s n 


If we assume f € CY with r > o — 1, we can iterate this argument sufficiently 
often to obtain u € C™—1+°~-§¢, for arbitrary ¢ > 0. Now we can arrange s++o > 
1+, so we are now in a position to apply Theorem 4.4. This proves the following: 


Theorem 4.5. [fu solves the quasi-linear elliptic PDE (4.21), then 
(4.32) ueC™ Ven Am lop feci=uec™, 
provided 1 < p< cand 
(4.33) s>0, sto>1, r>oa-1. 
Note that if u € H™” for some p > n, thenu € C™—1+ for s = 1—n/p > 0, 
and then (4.32) applies, with o = 1, or even witho = n/p +e. 


We next obtain a result regarding the regularity of solutions to a completely 
nonlinear elliptic system 


(4.34) F(a, D™u) = f. 
We could apply Theorems 4.2 and 4.3 to the equation for u; = Ou/Oz;: 


Of _ 


OF m a m 
(4.35) > 5—(a, D™u)D°u; = -F,,(x,D aa 


fj. 
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Suppose u € C™*S, s > 0, so the coefficients ag (xz) = (OF /0Cq)(x, D™u) € 
C*. If f € CY, then f; € C* + C?~'. We can apply Theorems 4.2 and 4.3 
to u; provided u € C™+1~S*®, to conclude that u € C™tst! U C”*", This 
implication can be iterated as long as s + 1 <r, until we obtain u € C?”*", 

This argument has the drawback of requiring too much regularity of uw, namely 
that u € O™+1~s+ as well as u € C™*S. We can fix this up by considering 
difference quotients rather than derivatives 0;u. Thus, for y € R”, |y| small, set 


vy() = |y|* [ule + y) — u(2))]; 


Vy Satisfies the PDE 


(4.36) S Ga(2) Dy, 2) = Ga), 


|ox|<m 


where 
(4.37) ®g,(x) = | (OF /OCq)(x,tD™ u(x) + (1—t)D™u(x + y)) dt 


and G, is an appropriate analogue of the right side of (4.35). Thus ®,, is in C®, 
uniformly as |y| + 0, if u € C™**, while this hypothesis also gives a uniform 
bound on the C”~!*S-norm of vy. Now, for each y, Theorems 4.2 and 4.3 apply 
to vy, and one can get an estimate on ||vy||om+e,p = min(s,r — 1), uniform as 
|y| — 0. Therefore, we have the following. 


Theorem 4.6. [fu solves the elliptic PDE (4.34), then 


(4.38) “ueCc™s, fectT=suecr™, 
provided 
(4.39) O<s<r. 


We shall now give a second approach to regularity results for nonlinear elliptic 
PDE, making use of the paradifferential operator calculus developed in § 10 of 
Chap. 13. In addition to giving another perspective on interior estimates, this will 
also serve as a warm-up for the work on boundary estimates in § 8. 

If F' is smooth in its arguments, then, as shown in (10.53)—(10.55) of Chap. 13, 


(4.40) F(x,D™u) = S> M,(«,D)D°ut F(x, D"Wo(D)u), 


Ja|<m 


where F(x, D™Wo(D)u) € C™ and 
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(4.41) = mae) )wn4i(8), 

with 
Oye 

(4.42) me(e) = | ag, (Ee(D)D™u + tes1(D)D™u) dt. 
0 a 


As shown in Proposition 10.7 of Chap. 13, we have, for r > 0, 
(4.43) weCc™r => Ma (a, €) € ABST, C S210 CSP, 


We recall from (10.31) of Chap. 13 that 
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(4.44) p(x,£) € ApST; <=> ||D#n(-, O|lorts < Cas (EY™ IN, 5 > 0. 


Consequently, if we set 


(4.45) 1(u;2,D)= S~ M(x, D)D* 


|a|<m 
we obtain 
Proposition 4.7. Ifu <¢ C™t", r > 0, then 
(4.46) F(a, D™u) = M(u;2, D)ut+ R, 
with R € C'™ and 
(4.47) M(u; 2, €) € AgSt C ST ACS 


Decomposing each M,(a, €), we have, by (10.60)-(10.61) of Chap. 13, 


(4.48) M(u; 2,6 = M* (2,6 + M(a,6), 
with 

(4.49) M* (a, €) € AGST C ST 

and 

(4.50) M*(a,£) € CP ste 1 AST, c ST”. 


Let us explicitly recall that (4.49) implies 
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D3M* (a, €) € Sis, IB| <r, 


(4.51) 
rma || >r. 


Note that the linearization of F(x, D™w) at u is given by 


(4.52) lv= > M,(2)D%, 
|a|<m 
where 
~ OF -_ 
(4.53) M(x) = ac, P u). 


Comparison with (4.40)-(4.42) gives (for u € C™*") 

(4.54) M(u;2,€) — L(z,€) e CT Syy", 

by the same analysis as in the proof of the 6 = 1 case of (9.35) of Chap. 13. More 
generally, the difference in (4.54) belongs to C’S7"5- re 0 <6 <1. Thus L(z, €) 
and M(u; x, €) have many qualitative properties in common. 

Consequently, given u € C™*+", the operator M*#(x,D) € OPS%"5 is 
microlocally elliptic in any direction (29, 9) € T*R” \ 0 that is noncharacteristic 
for F(x, D™ wu), which by definition means noncharacteristic for L. In particular, 
M# (a, D) is elliptic if F(x, D™u) is. Now if 
(4.55) F(z, D™u) = f 
is elliptic and Q € OPS," is a parametrix for M* (a, D), we have 
(4.56) u=Q(f—M®(z,D)u), mod C@. 

By (4.50) we have 

(4.57) OM @D) snr — eee ge i, 

(In fact s > —(1 — 5)r suffices.) We deduce that 

(4.58) uc H™-etsp fe HP —s ye HHS? 

granted r > 0, s > 0, and p © (1,00). There is a similar implication, with 
Sobolev spaces replaced by Hélder (or Zygmund) spaces. This sort of implication 
can be iterated, leading to a second proof of Theorem 4.6. We restate the result, 


including Sobolev estimates, which could also have been obtained by the first 
method used to prove Theorem 4.6. 
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Theorem 4.8. Suppose, given r > 0, thatu € C™*" satisfies (4.55) and this 
PDE is elliptic. Then, for each s > 0, p € (1,00), 


(4.59) fe H*? ue H™*? and fect =—uec™s., 
By way of further comparison with the methods used earlier in this section, 


we now rederive Theorem 4.5, on regularity for solutions to a quasi-linear elliptic 
PDE. Note that, in the quasi-linear case, 


(4.60) FED") = 5 ae DS 7, 


lal<m 


the construction above gives F(z,D™u) = M(u;x,D)u + Ro(u) with the 
property that, for r > 0, 


(4.61) weO™'™ => M(u;2,6) € OT ST ST, + C°STs NST. 
Of more interest to us now is that, for0 < r < 1, 

(4.62) uEeConr tT —> M(u;z,8) € CST ASTM + ST", 

which follows from (10.23) of Chap. 13. Thus we can decompose the term in 


CT S79NS7", via symbol smoothing, as in (10.60)—(10.61) of Chap. 13, and throw 
the term in Sj"; " into the remainder, to get 


(4.63) M(u;a,€) = M*(a,é) + M°(a, 6), 
with 
(4.64) M*#(a,€)€ ST, M(2,6)€ 97". 


If P(x, D) € OPS," is a parametrix for the elliptic operator M* (x, D), then 
whenever u € C™—!+" 4 H™—1!+?-P is a solution to (4.60), we have, mod C®™, 


(4.65) u = P(x, D)f — P(a, D)M°(a, D)u. 
Now 
(4.66) P(a,D)M°(x,D):H™- +9? —, H™-letrop fp + p> 1, 


by the last part of (4.64). As long as this holds, we can iterate this argument and 
obtain Theorem 4.5, with a shorter proof than the one given before. 

Next we look at one example of a quasi-linear elliptic system in divergence 
form, with a couple of special features. One is that we will be able to assume less 
regularity a priori on u than in results above. The other is that the lower-order 
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terms have a more significant impact on the analysis than above. After analyzing 

the following system, we will show how it arises in the study of the Ricci tensor. 
We consider second-order elliptic systems of the form 

(4.67) S- 0;4;%(@,u)O,u+ B(x, u, Vu) = f. 

We assume that a;x(x,u) and B(x, u, p) are smooth in their arguments and that 

(4.68) |B(x,u,p)| < Cp)”. 

Proposition 4.9. Assume that a solution u to (4.67) satisfies 

(4.69) Vue L!, forsomeq>n, henceu € C", 

for some r € (0,1). Then, if p € (q,00) and s > —1, we have 

(4.70) fe H®? ue Hs??, 


To begin the proof of Proposition 4.9, we write 


(4.71) S > ajn(x, u) Onu = Aj(u;z, D)u 
k 


mod C'®, with 
(4.72) ue CT => Aj(ujz,é) € CST NST, + S17", 
as established in Chap. 13. Hence, given 6 € (0,1), 


Aj(u;@,€) = A¥ (a, €) + Ab(z, 8), 


(4.73) 
A® (a, €) € Sts A®(a, €) € Cheer 


It follows that we can write 


(4.74) > 045% (2, U) Ogu = P¥u+ Pou, 
with 

(4.75) pP# = 5° 0;A*(«,D) € OPS}, elliptic, 
and 


(4.76) P=)" 0;A"(q, D). 
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By Theorem 9.1 of Chap. 13, we have 
(4.77) A®(x,D): Hit? _y HP’ for u>0, 1 <p! < ov. 


In particular (taking pp = rd, p’ = q), 


(4.78) Vue LY => Plhue H-14784, 
Now, if 
(4.79) EY € OPS; 


denotes a parametrix of P#, we have, mod C™, 
(4.80) u = E* f — E* B(a,u, Vu) — E* Pou, 


and we see that under the hypothesis (4.69), we have some control over the last 
term: 


(4.81) E# Pou © H1+784 Cc HN z pens 
q q n 

Note also that under our hypothesis on B(x, u, p), 

(4.82) Vu € LY => B(a,u, Vu) € LY?, 

Now, by Sobolev’s imbedding theorem, 

(4.83) E* B(x,u, Vu) € Ht, 


with p = q/(2 — q/n) if q < 2n and for all p < cif q > 2n. Note that 
p> q(1+a/n) if g = n+4a. This treats the middle term on the right side of 
(4.80). Of course, the hypothesis on f yields 

(4.84) B¥ fe H**?, 542>1, 


which is just where we want to place wu. 
Having thus analyzed the three terms on the right side of (4.80), we have 


# eee 
(4.85) uc Ht, g# =min(p,p,q). 
Iterating this argument a finite number of times, we get 


(4.86) u€ H)?. 
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If s = —1 in (4.70), our work is done. 

If s > —1 in (4.70), we proceed as follows. We already have u € H'?, so 
Vu € L?. Thus, on the next pass through estimates of the form (4.78)-(4.83), we 
obtain 


E# Phy ¢ Hitrée 


(4.87) 
E* B(x, u, Vu) € H??/? cH?" Pe, 
and hence 
(4.88) ue Hite?) g = min (vs tos ik :) 
Pp 


We can iterate this sort of argument a finite number of times until the conclusion 
in (4.70) is reached. 

Further results on elliptic systems of the form (4.67) will be given in § 12B. 
We now apply Proposition 4.9 to estimates involving the Ricci tensor. Consider a 
Riemannian metric gj, defined on the unit ball By C IR”. We will work under the 
following hypotheses: 

(i) For some constants a; € (0,00), there are estimates 


(ii) The coordinates x1,...,%y, are harmonic, namely 
(4.90) Age = 0. 


Here, A is the Laplace operator determined by the metric g,;,. In general, 
(4.91) Av = p*Oj;Onv —NMAv, N= gl 5K. 


Note that Az, = —X°, so the coordinates are harmonic if and only if NM = 0. 
Thus, in harmonic coordinates, 


(4.92) Av = g)* 0;Oxv. 


We will also assume some bounds on the Ricci tensor, and we desire to see 
how this influences the regularity of g;,, in these coordinates. Generally, as can be 
derived from formulas in § 3 of Appendix C, the Ricci tensor is given by 


i 
Ric jz, = ao [—PeOm 95K = Oj OK Gem 
(4.93) + OOmGe; + 80; 9m] + Mjx(9, V9) 


1 1 1 
= = 59°" Omg ik a 5 950K + 5 9IRCOjN + Ayx(g, V9), 
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with \° as in (4.91). In harmonic coordinates, we obtain 


1 : 
Se 5 > 459" (2) Okgem + Qem(g.V9) = Ricem, 


and Qem(g, Vg) is a quadratic form in Vg, with coefficients that are smooth func- 
tions of g, as long as (4.89) holds. Also, when (4.89) holds, the equation (4.94) is 
elliptic, of the form (4.67). Thus Proposition 4.9 implies the following. 


Proposition 4.10. Assume the metric tensor satisfies hypotheses (i) and (ii). Also 
assume that, on By, 


(4.95) Vajr € L', forsomeq>n, 
and 
(4.96) Ricem € H*?, 


for some p € (q,00), 8 => —1. Then, on the ball Bg/10, 


(4.97) Ojk © Het?” 


In [DK] it was shown that if gj, € C?, in harmonic coordinates, then, for 
k € Zt, a € (0,1), Ricem € C*t* => gj, € C*t?*, Such results also 
follow by the methods used to prove Proposition 4.10. See Proposition 12B.2 fora 
stronger result. A variant of Proposition 4.10, using Morrey spaces, is established 
in [T2]. Another variant of (4.96) = (4.97), 


(4.98) Ricem € L© => V7gj~ € BMO, 


is established in §3.10 of [T3], and further consequences are pursued in §3.11 of 
that work. 


Exercises 
1. Consider the system F'(x,D™u) = f when 


F(¢,D™u)= >> aa(z, D’u) D*u, 


ja|<m 


for some j such that0 < j < m. Assume this quasi-linear system is elliptic. Given 
p,q € (1,00), r > 0, assume 


Uwe Crom |e? pt p>, 


Show that 
jen en", 
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5. Isometric imbedding of Riemannian manifolds 


In this section we will establish the following result. 


Theorem 5.1. /f M is a compact Riemannian manifold, there exists a C'°-map 
(5.1) &:M—>R*, 
which is an isometric imbedding. 


This was first proved by J. Nash [Nal], but the proof was vastly simplified by 
M. Giinther [Gul ]-[Gu3]. These works also deal with noncompact Riemannian 
manifolds and derive good bounds for N, but to keep the exposition simple we 
will not cover these results. 

To prove Theorem 5.1, we can suppose without loss of generality that M is 
a torus T*. In fact, imbed M smoothly in some Euclidean space R*; M will sit 
inside some box; identify opposite faces to have M Cc T*. Then smoothly extend 
the Riemannian metric on M to one on TY. 

If R denotes the set of smooth Riemannian metrics on T* and € is the set of 
such metrics arising from smooth imbeddings of T” into some Euclidean space, 
our goal is to prove 


(5.2) E=R. 
Now 7 is clearly an open convex cone in the Fréchet space 
V=C™(' 57") 


of smooth, second-order, symmetric, covariant tensor fields. As a preliminary to 
demonstrating (5.2), we show that the subset € shares some of these properties. 


Lemma 5.2. € is a convex cone in V. 


Proof. If go € €, it is obvious from scaling the imbedding producing go that 
ago € €, for any a € (0,00). Suppose also that g; € €. If these metrics g; 
arise from imbeddings :; : T* + R”/, then go + gi is a metric arising from the 
imbedding yo © v1 : TX + R”°+”:. This proves the lemma. 


Using Lemma 5.2 plus some functional analysis, we will proceed to establish 
that any Riemannian metric on T* can be approximated by one in €. First, we 
define some more useful objects. If u : T* — R™ is any smooth map, let 7, 
denote the symmetric tensor field on T” obtained by pulling back the Euclidean 
metric on R™. Ina natural local coordinate system on T* = R*/Z*, arising from 
standard coordinates (21,...,2,) on R*, 


(5.3) => ee oe dx; ® day. 
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Whenever wu is an immersion, 7, is a Riemannian metric; and if u is an imbedding, 
then +, is of course an element of €. Denote by C the set of tensor fields on T* of 
the form +,,. By the same reasoning as in Lemma 5.2, C is a convex cone in V. 


Lemma 5.3. € is a dense subset of R. 


Proof. If not, take g € R such that g ¢ €, the closure of € in V. Now € isa 
closed, convex subset of V, so the Hahn—Banach theorem implies that there is a 
continuous linear functional £ : V — R such that ¢(€) < 0 while £(g) =a > 0. 

Let us note that C C € (and hence C = &). In fact, if u : T* > R™ is any 
smooth map and y : T* — R” is an imbedding, then, for any € > 0, ep @u: 
T* + R"*™ is an imbedding, and yeyqu = €?%y~ + Yu € E. Taking € \, 0, we 
have 7, € €. 

Consequently, the linear functional £ produced above has the property 
£(C) < 0. Now we can represent £ as a k x k symmetric matrix of distribu- 
tions ¢;; on T*, and we deduce that 


(5.4) S (Of O;f,45) $0, VF eC°(T*). 

ag 
If we apply a Friedrichs mollifier J-, in the form of a convolution opera- 
tor on T*, it follows easily that (5.4) holds with ¢;; € D’(T*) replaced by 
riz = Jeliz € C(T*). Now it is an exercise to show that if A;; € C°(T*) 
satisfies both A;; = A;, and the analogue of (5.4), then A = (A,;) is a negative- 


semidefinite, matrix-valued function on T*, and hence, for any positive-definite 
G= (gi;) E Curr s-T*), 


(5.5) do (gi5s Aig) S 0. 
Taking \;; = J-¢;; and passing to the limit ¢ — 0, we have 
(5.6) S"(9i3, £23) < 0, 

ag 


for any Riemannian metric tensor (g;;) on T*. This contradicts the hypothesis 
that we can take g ¢ €, so Lemma 5.3 is proved. 


The following result, to the effect that € has nonempty interior, is the analytical 
heart of the proof of Theorem 5.1. 


Lemma 5.4. There exist a Riemannian metric go € E and a neighborhood U of 
0 in V such that gp + h € E whenever h € U. 


We now prove (5.2), hence Theorem 5.1, granted this result. Let g € R, and 
take go € E, given by Lemma 5.4. Then set gi = g + a(g — go), where a > 0 
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is picked sufficiently small that g; € 7. It follows that g is a convex combination 
of go and gj; that is, g = ago + (1 — a)gi for some a € (0,1). By Lemma 5.4, 
we have an open set U C V such that g) + h € € whenever h € U. But by 
Lemma 5.3, there exists h € U such that g, — bh € E, b = a/(1 — a). Thus 
g = a(go +h) + (1 — a)(gi — bh) is a convex combination of elements of E, so 
by Lemma 5.1, g € €, as desired. 

We turn now to a proof of Lemma 5.4. The metric go will be one arising from 
a free imbedding 


(5.7) u:T® — R4, 


defined as follows. 


Definition. An imbedding as in (5.7) is free provided that the k + k(k + 1)/2 
vectors 


(5.8) O;u(x), Oj;Onu(x) 


are linearly independent in R", for each x € T*. 


Here, we regard T* =R* / Z*, sou: R* — R*, invariant under the translation 
action of Z* on R*, and (21,...a) are the standard coordinates on R*. It is not 
hard to establish the existence of free imbeddings; see the exercises. 

Now, given that u is a free imbedding and that (h;;) is a smooth, symmetric 
tensor field that is small in some norm (stronger than the C?-norm), we want to 
find v € C%(T*, R“), small in a norm at least as strong as the Ct-norm, such 
that, with go = Ju; 


(5.9) x O;(ue + ve)O; (ue + Ve) = Goig + hig, 
e 
or equivalently, using the dot product on R“, 


(5.10) Oyu: Oju + Oju- jv + Gv + djvu = his. 


We want to solve for v. Now, such a system turns out to be highly underdeter- 
mined, and the key to success is to append convenient side conditions. Following 
[Gu3], we apply A — 1 to (5.10), where A = $> 0?, obtaining 


a{(d ~1)(dju-v) + Av- ajv} i a,{(A ~1)(0,u-v) + Av- aww 
(5.11) —2 {ia _— 1)(0;0;u sv) + save . Ov — O;Oev - 0; Oev 


where we sum over ¢. Thus (5.10) will hold whenever v satisfies the new system 
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(5.12) 
(A —1)(G(a)-v) = — Av- jv, 
(A ~1)(Gj(a)-¥) = — F(A—-Dhy 


+ (2:0. : O;Oev —Av- O,0;0 = sav : aye) ; 


Here we have set ¢;(x) = Oju(a), ¢i;(x) = 0,0; u(x), smooth R“-valued func- 
tions on T*. 

Now (5.12) is a system of &(k +3) /2 = « equations in ys unknowns, and it has 
the form 


(5.13) (A —1)(€(x)v) + Q(D?0, D’v) = H = (0 -3(a = 1)h) . 


where (2) : R* — R’* is surjective for each x, by the linear independence 
hypothesis on (5.8), and Q is a bilinear function of its arguments D?v = {D%v : 
|a| < 2}. This is hence an underdetermined system for v. We can obtain a deter- 
mined system for a function w on T* with values in R", by setting 


(5.14) v = €(x)'w, 
namely 
(5.15) (A — 1)(A(z)w) + Q(D?u, D?w) = H, 


where, for each x € T*, 
(5.16) A(x) = €(x)€(x)’ € End(R*) is invertible. 


If we denote the left side of (5.15) by F'(w), the operator F’ is a nonlinear differ- 
ential operator of order 2, and we have 


(5.17) DF(w)f = (A —1)(A(2)f) + B(D?w, D’f), 
where B is a bilinear function of its arguments. In particular, 
(5.18) DF(0)f =(A-— 1)(A(a)f). 


We thus see that, for any r € Rt \ Z*, 


(5.19) DF(0) : C’*?(T*,R®) —+ C"(T*, R*) is invertible. 


Consequently, if we fix r € R* \ Z*, and if H € C"(T*,R*) has sufficiently 
small norm (ice., if (hij) € C’*?(T*, S27") has sufficiently small norm), then 
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(5.15) has a unique solution w € O"+?(T*, R”) with small norm, and via (5.14) 
we get a solution v € OT? Cr, R*“), with small norm, to (5.13). If the norm of v 
is small enough, then of course u + v is also an imbedding. 

Furthermore, if the C’*?-norm of w is small enough, then (5.15) is an elliptic 
system for w. By the regularity result of Theorem 4.6, we can deduce that w is 
C@ (hence v is C'°°) if h is C°°. This concludes the proof of Lemma 5.4, hence 
of Nash’s imbedding theorem. 


Exercises 


In Exercises 1-3, let B be the unit ball in R*, centered at 0. Let (Ai;) be a smooth, 
symmetric, matrix-valued function on B such that 


(5.20) > [ane (0; f)(a) Auj(a) dx <0, Wf € C§°(B). 


1. Taking f- € Co°(B) of the form 
fiej=fe ae “£),, 0<e<1, 


examine the behavior as € \, 0 of (5.20), with f replaced by f-. Establish that A11(0) < 
0. 

2. Show that the condition (5.20) is invariant under rotations of R*, and deduce that 

(Ai; (0)) is a negative-semidefinite matrix. 

Deduce that (A,;;(x)) is negative-semidefinite for all x € B. 

4. Using the results above, demonstrate the implication (5.4) = (5.5), used in the proof of 
Lemma 5.3. 

5. Suppose we have a C®°-imbedding y : T* — R”. Define a map 


bee 


b:T* RR’ SS°R" sR", want 5n(n +1), 
to have components 
yi(z), ls jn, pilx)pi(x), 1<icjen. 


Show that ¢ is a free imbedding. 

6. Using Leibniz’ rule to expand derivatives of products, verify that (5.10) and (5.11) are 
equivalent, for v € C®(T*, R“). 

7. In [Nal] the system (5.10) was augmented with 0;u-v = 0, yielding, instead of (5.12), 
the system 


(5.21) i 


What makes this system more difficult to solve than (5.12)? 
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6. Minimal surfaces 


A minimal surface is one that is critical for the area functional. To begin, we 
consider a k-dimensional manifold M (generally with boundary) in R”. Let € be 
a compactly supported normal field to /, and consider the one-parameter family 
of manifolds 1/7, C R”, images of M/ under the maps 


(6.1) ps(t) =a24+ s(x), cEeM. 


We want a formula for the derivative of the k-dimensional area of M,, at s = 0. 
Let us suppose € is supported on a single coordinate chart, and write 


(6.2) As) = [|X A--A aX du; ++ dup, 
2 


where 2 C R* parameterizes M, by X(s,u) = Xo(u) + s€(u). We can also 


suppose this chart is chosen so that ||0; Xo A --- A O,Xo|| = 1. Then we have 
(6.3) 
A"(0) = 


k 
Do | (Xo Ae NB}EN + AAeX0,H1Xo A+++ A DX) du, --- dug. 
j=l 


By the Weingarten formula (see (4.9) of Appendix C), we can replace 0;€ by 
—AgE;, where FE; = 0; Xo. Without loss of generality, for any fixed  € M, we 
can assume that /,..., E, is an orthonormal basis of T),./. Then 


(6.4) (BE, A+++ A AgE; A+++ A Ep, By A+++ A Eg) = (AcE;, £5), 


at x. Summing over j yields Tr A¢(x), which is invariantly defined, so we have 


(6.5) AO) == / Tr A(x) dA(z), 
M 


where A¢(x) € End(T,M) is the Weingarten map of M and dA(z) the Rie- 
mannian k-dimensional area element. We say / is a minimal submanifold of R” 
provided A’(0) = 0 for all variations of the form (6.1), for which the normal field 
€ vanishes on OM. 

If we specialize to the case where / = n—1 and M is an oriented hypersurface 
of R”, letting N be the “outward” unit normal to /, for a variation M, of M 
given by 


(6.6) ys(t) =a+sf(r)N(a), «eM, 


we hence have 
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(6.7) Ais / Tr An (2) f(x) dA(z). 


M 


The criterion for a hypersurface M of R” to be minimal is hence that Tr An = 0 
on M. 

Recall from § 4 of Appendix C that Aj (a) is a symmetric operator on T;, M. 
Its eigenvalues, which are all real, are called the principal curvatures of M/ at x. 
Various symmetric polynomials in these principal curvatures furnish quantities of 
interest. The mean curvature H(x) of M at x is defined to be the mean value of 
these principal curvatures, that is, 


(6.8) A(x) = - Tr Ay (a). 


Thus a hypersurface 1Z C R” is a minimal submanifold of R” precisely when 
H1=0O0onM. 

Note that changing the sign of N changes the sign of Ay, hence of H. Under 
such a sign change, the mean curvature vector 


(6.9) (x) = H(x)N(z) 
is invariant. In particular, this is well defined whether or not / is orientable, and 
its vanishing is the condition for M to be a minimal submanifold. There is the 


following useful formula for the mean curvature of a hypersurface M/ Cc R”. Let 
X : M ~— R" be the isometric imbedding. We claim that 


1 
(6.10) H(x) = Zan: 
with k = n—1, where A is the Laplace operator on the Riemannian manifold M/, 
acting componentwise on X. This is easy to see at a point p € M if we translate 
and rotate IR” to make p = 0 and represent MV as the image of R* = R”~! under 
(6.11) FO)AHa@ Fey OH Gao), VIO HO 
Then one verifies that 
AX(p) = O7Y (0) +--+ + OZY (0) = (0,...,0,07 f(0) +--+ OF F(0)), 

and (6.10) follows from the formula 

k 
(6.12) (An(0)X,Y) = $© 0,0; f(0) Xi¥; 

i,j=l 


for the second fundamental form of J at p, derived in (4.19) of Appendix C. 
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More generally, if MM C R” has dimension k < n — 1, we can define the mean 
curvature vector (x) by 


(6.13) (9(x),£) = - Tr Ae(x), (2) L T,M, 


so the criterion for (MZ to be a minimal submanifold is that § = 0. Further- 
more, (6.10) continues to hold. This can be seen by the same type of argument 
used above; represent MV as the image of R* under (6.11), where now f(x’) = 


(Wp+1,--+-;Ln). Then (6.12) generalizes to 
k 
(6.14) (Ac(0)X,¥) = S© (E,0;0;f(0)) XiYj, 
ij=l 


which yields (6.10). We record this observation. 


Proposition 6.1. Let X : M — R” be an isometric immersion of a Riemannian 
manifold into R". Then M is a minimal submanifold of R” if and only if the 
coordinate functions £1, ..., Zp, are harmonic functions on M. 


A two-dimensional minimal submanifold of R” is called a minimal surface. 
The theory is most developed in this case, and we will concentrate on the two- 
dimensional case in the material below. 

When dim M = 2, we can extend Proposition 6.1 to cases where X : M —> 
IR” is not an isometric map. This occurs because, in such a case, the class of 
harmonic functions on M is invariant under conformal changes of metric. In fact, 
if A is the Laplace operator for a Riemannian metric gj; on M and A, that for 
guij = €F"gi;, then, since Af = g~'/? A;(g*4g*/? 0; f) and gi? = e~2“g"4, while 
gi.” = ek“ g'/2 (if dim M = k), we have 


(6.15) Aif =e7™ Af + e7*" (df, de®-2") =e "Af ifk =2. 
Hence ker A = ker A, if & = 2. We hence have the following: 


Proposition 6.2. [fQ is a Riemannian manifold of dimension 2 and X :Q. — R” 
a smooth immersion, with image M, then M is a minimal surface provided X is 
harmonic and X :Q — M is conformal. 


In fact, granted that X :Q — M is conformal, M is minimal if and only if X 
is harmonic on 22. 

We can use this result to produce lots of examples of minimal surfaces, by the 
following classical device. Take 2 to be an open set in R? = C, with coordinates 
(ui, U2). Given a map X : 2 — R”, with components z; : 2 — R, form the 
complex-valued functions 
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(6.16) ies iF = 2S, C= uy t iu. 


Clearly, ~; is holomorphic if and only if x; is harmonic (for the standard flat 
metric on ), since A = 4(0/0¢)(0/0C). Furthermore, a short calculation gives 


(6.17) S-by(6)? = |X|? — |O2X|? — 21 LX - AX. 
j=1 


Granted that _X : Q — R” is an immersion, the criterion that it be conformal is 
precisely that this quantity vanish. We have the following result. 


Proposition 6.3. [fw1,..., Wn are holomorphic functions on Q C C such that 
(6.18) S73 (6)? =0 onQ, 
j=l 


while >> \w;(C)|? 4 0 on Q, then setting 


(6.19) rj(u) = Re f ¥4(6) dc 
defines an immersion X : Q — R” whose image is a minimal surface. 


If Q is not simply connected, the domain of X is actually the universal covering 
surface of 2. 

We mention some particularly famous minimal surfaces in R°® that arise in such 
a fashion. Surely the premier candidate for (6.18) is 


(6.20) sin? ¢ + cos*¢ —1=0. 
Here, take #1(¢) = sin¢, W2(¢) = —cos¢, and w3(¢) = —7. Then (6.19) yields 
(6.21) x1 = (cosu;)(coshug), x2 = (sinuz)(coshug), 23 = U2. 


The surface obtained in R? is called the catenoid. It is the surface of revolution 
about the w3-axis of the curve 7; = cosh 3 in the (x; — x3)-plane. Whenever 
w;(C) are holomorphic functions satisfying (6.18), so are e’y);(¢), for any 0 € R. 
The resulting immersions X¢g : 22 + R” give rise to a family of minimal surfaces 
Ms Cc R", which are said to be associated. In particular, M,, /2 is said to be 
conjugate to M = Mo. When Mp is the catenoid, defined by (6.21), the conjugate 
minimal surface arises from 71(¢) = 74 sin¢, w2(¢) = —7 cos¢, and w3(¢) = 1 
and is given by 


(6.22) x1 = (sinu)(sinhug), v2 = (cosui)(sinhue), v3 = U4. 
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This surface is called the helicoid. We mention that associated minimal surfaces 
are locally isometric but generally not congruent; that is, the isometry between 
the surfaces does not extend to a rigid motion of the ambient Euclidean space. 
The catenoid and helicoid were given as examples of minimal surfaces by 
Meusnier, in 1776. 
One systematic way to produce triples of holomorphic functions 7,;(¢) satis- 
fying (6.18) is to take 


il 


(6.23) v=; 


f-9), ve=5ft9"), vs= fo, 

for arbitrary holomorphic functions f and g on 2. More generally, g can be 
meromorphic on 22 as long as f has a zero of order 2m at each point where 
g has a pole of order m. The resulting map X : Q > M C R? is called 
the Weierstrass-Enneper representation of the minimal surface M. It has an 
interesting connection with the Gauss map of M/, which will be sketched in the 
exercises. The example arising from f = 1, g = ¢ produces “Enneper’s surface.” 
This surface is immersed in R® but not imbedded. 

For a long time the only known examples of complete imbedded minimal 
surfaces in R® of finite topological type were the plane, the catenoid, and the 
helicoid, but in the 1980s it was proved by [HM1] that the surface obtained by 
taking g = ¢ and f(¢) = (C) (the Weierstrass g-function) is another example. 
Further examples have been found; computer graphics have been a valuable aid 
in this search; see [HM2]. 

A natural question is how general is the class of minimal surfaces arising from 
the construction in Proposition 6.3. In fact, it is easy to see that every minimal 
M C R” is at least locally representable in such a fashion, using the existence of 
local isothermal coordinates, established in § 10 of Chap. 5. Thus any p € MM has 
a neighborhood O such that there is a conformal diffeomorphism X : Q — O, for 
some open set 2. C R?. By Proposition 6.2 and the remark following it, if M is 
minimal, then X must be harmonic, so (6.16) furnishes the functions 7); (¢) used 
in Proposition 6.3. Incidentally, this shows that any minimal surface in R” is real 
analytic. 

As for the question of whether the construction of Proposition 6.3 globally rep- 
resents every minimal surface, the answer here is also “yes.” A proof uses the fact 
that every noncompact Riemann surface (without boundary) is covered by either 
C or the unit disk in C. This is a more complete version of the uniformization 
theorem than the one we established in § 2 of this chapter. A positive answer, for 
simply connected, compact minimal surfaces, with smooth boundary, is implied 
by the following result, which will also be useful for an attack on the Plateau 
problem. 


Proposition 6.4. If M is a compact, connected, simply connected Riemannian 
manifold of dimension 2, with nonempty, smooth boundary, then there exists a 
conformal diffeomorphism 
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(6.24) &:M — D, 
where D = {(r,y) € R*: 27+ y? < 1}. 


This is a slight generalization of the Riemann mapping theorem, established 
in § 4 of Chap. 5, and it has a proof along the lines of the argument given there. 
Thus, fix p € M, and let G € D’'(M) 9 C™(M \ p) be the unique solution to 


(6.25) AG = 276, G=O0on0M. 


Since M is simply connected, it is orientable, so we can pick a Hodge star oper- 
ator, and «dG = (3 is a smooth closed 1-form on M \ p. If 7 is a curve in M 
of degree | about p, then ile (@ can be calculated by deforming y to be a small 
curve about p. The parametrix construction for the solution to (6.25), in normal 
coordinates centered at p, gives G(x) ~ log dist(x,p), and one establishes that 
i (3 = 2r. Thus we can write G = dH, where H is a smooth function on M \ p, 


well defined mod 27Z. Hence 6(x) = e@+*” is a single-valued function, tending 
to 0 as x — p, which one verifies to be the desired conformal diffeomorphism 
(6.24), by the same reasoning as used to complete the proof of Theorem 4.1 in 
Chap. 5. 

An immediate corollary is that the argument given above for the local 
representation of a minimal surface in the form (6.19) extends to a global 
representation of a compact, simply connected minimal surface, with smooth 
boundary. 

So far we have dealt with smooth surfaces, at least immersed in R”. The 
theorem of J. Douglas and T. Rado that we now tackle deals with “generalized” 
surfaces, which we will simply define to be the images of two-dimensional mani- 
folds under smooth maps into R” (or some other manifold). The theorem, a partial 
answer to the “Plateau problem,” asserts the existence of an area-minimizing gen- 
eralized surface whose boundary is a given simple, closed curve in R”. 

To be precise, let 7 be a smooth, simple, closed curve in R”, that is, a diffeo- 
morphic image of $+. Let 


x. = {py € C(D,R") NC™*(D,R") : 


(6.26) : 
yp: 5° — y monotone, and a(y) < oo}, 


where a is the area functional: 


(6.27) aly) = / lArp A Oap| dry dry. 
D 


Then let 


(6.28) A, = inf{a(y) : y € Xy}. 
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The existence theorem of Douglas and Rado is the following: 
Theorem 6.5. There is a map (p € X such that a(ip) = Ay. 

We can choose y, € Xy such that a(y,) \, Ay, but {y,} could hardly be 
expected to have a convergent subsequence unless some structure is imposed on 
the maps y,. The reason is that a(y) = a(y o Ww) for any C’°-diffeomorphism 
w:D— D. We say yo w is a reparameterization of y. The key to success is 


to take y,, which approximately minimize not only the area functional a(y) but 
also the energy functional 


(6.29) O(y) = five@e dx\dx2, 
D 


so that we will also have J(p,) \, dy, where 
(6.30) dy =inf{v(y): p € Xy}. 


To relate these, we compare (6.29) and the area functional (6.27). 
To compare integrands, we have 


(6.31) [Vel? = aryl? + |aael?, 
while the square of the integrand in (6.27) is equal to 


|A.p A doyp|? = |A19|?|O2"|? — (A1¢, O29) 
|A19|7|829)? 


1 
lav? + leavl?)”, 


IA 


(6.32) 


IA 


where equality holds if and only if 
(6.33) lOry| = |O2¢p| and (Oy, O29) =0. 
Whenever Vy # 0, this is the condition that y be conformal. More generally, if 


(6.33) holds, but we allow Vy(a) = 0, we say that ¢ is essentially conformal. 
Thus, we have seen that, for each yp € X,, 


1 
(6.34) a(y) < 7%); 


with equality if and only if y is essentially conformal. The following result allows 
us to transform the problem of minimizing a(y) over X., into that of minimizing 
0(~) over X., which will be an important tool in the proof of Theorem 6.5. Set 
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(6.35) xP =ipe C*(D,R") sy S' — 4 diffeo.}. 
Proposition 6.6. Given e > 0, any p € X}° has a reparameterization y 0 wy such 
that 
1 
(6.36) 5 °(PO¥) S aly) +e. 


Proof. We will obtain this from Proposition 6.4, but that result may not apply 
to y(D), so we do the following. Take 5 > 0 and define ®5 : D 4 R”*? by 
®5(x) = (y(x), dx). For any 5 > 0, ®5 is a diffeomorphism of D onto its 
image, and if 6 is very small, area ®s(D) is only a little larger than area y(D). 
Now, by Proposition 6.4, there is a conformal diffeomorphism V : ®;(D) > D. 
Set p = ws = (Vo 5)" : D + D. Then ®; 0 w = W~! and, as established 
above, (1/2)3(W-!) = Area(W—!(D)), ie., 


(6.37) $0(®5 0) = Area(®;(D)). 
Since V(yow) < V(®5 0 w), the result (6.34) follows if 6 is taken small enough. 


One can show that 
(6.38) A, =inf{a(y):pEeXP}, dy =inf{#(y): y € XP}. 


It then follows from Proposition 6.6 that A, = (1/2)d,, and if p, € X5° is 
chosen so that J(y,) + d,, then a fortiori a(y,) > Ay. 

There is still an obstacle to obtaining a convergent subsequence of such {y,}. 
Namely, the energy integral (6.29) is invariant under reparameterizations y +> 
yo for which 7) : D + D is a conformal diffeomorphism. We can put a clamp 
on this by noting that, given any two triples of (distinct) points {p1,p2,p2} and 
{q1, 42, 93} in $+ = OD, there is a unique conformal diffeomorphism 7) : D + D 
such that ¢(p;) = qj,1 < j < 3. Let us now make one choice of {p;} on S'— 
for example, the three cube roots of 1—and make one choice of a triple {q;} of 


FIGURE 6.1 Annular Region in the Disk 
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distinct points in y. The following key compactness result will enable us to prove 
Theorem 6.5. 


Proposition 6.7. For any d € (d,, 00), the set 
(6.39) Ya={ye X : y harmonic, y(p;) = qj, and Vp) < d} 
is relatively compact in C(D,R"). 


In view of the mapping properties of the Poisson integral, this result is equiva- 
lent to the relative compactness in C(OD, y) of 


(6.40) Sx = {ue C%(S", 7) diffeo. : u(p;) = qj, and ||ull z1/2(91) < K}, 


for any given K < oo. For u € Sx, we have ||ul| q1/2(91) © ||P ull x(a). To 
demonstrate this compactness, there is no loss of generality in taking y = S' C 
R? and Pj = Q- 

We will show that the oscillation of u over any arc I C S!' of length 26 is 
< CK /,\/log(1/6). This modulus of continuity will imply the compactness, by 
Ascoli’s theorem. 

Pick a point z € S*. Let C, denote the portion of the circle of radius r and 
center z which lies in D. Thus C,, is an arc, of length < mr. Let 6 € (0,1). Asr 
varies from 6 to V6, C;. sweeps out part of an annulus, as illustrated in Fig. 6.1. 

We claim there exists p € [6, V5] such that 


27 
log 


(6.41) [ive ds <K 


i 
Cy 2 


if K = ||V¢||z2(p), ¢ = Plu. To establish this, let 


Oe a Vel? ds. 


Cr 


vo ay v5 
| u(r) 2 = f [iver ds dr =I < K?. 
6 ‘ a 


By the mean-value theorem, there exists p € [0, V6] such that 


V5 
r=u(p) [ oA) og f 
6 i 


Then 


For this value of p, we have 
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21 2k? 
< 


log + ~ log < 


(6.42) | |Vy|? ds = 
Cp 


Then Cauchy’s inequality yields (6.41), since length(C’,) < mp. 

This almost gives the desired modulus of continuity. The arc C, is mapped 
by y into a curve of length < K\/27/log(1/6), whose endpoints divide + into 
two segments, one rather short (if 6 is small) and one not so short. There are two 
possibilities: (z) is contained in either the short segment (as in Fig. 6.2) or the 
long segment (as in Fig. 6.3). However, as long as y(p;) = p; for three points p,, 
this latter possibility cannot occur. We see that 


lu(a) — u(b)| < Ky, 
Og 5 


if a and b are the points where C,, intersects S!. Now the monotonicity of wu along 
S$! guarantees that the total variation of u on the (small) arc from a to b in S$" is 


< K,/27/log(1/6). This establishes the modulus of continuity and concludes 


the proof. 

Now that we have Proposition 6.7, we proceed as follows. Pick a sequence y,, 
in X5° such that J(~,) — d,, so also a(y,) — A,. Now we do not increase 
O(y_) if we replace vy, by the Poisson integral of y.| ap: and we do not alter this 


energy integral if we reparameterize via a conformal diffeomorphism to take {p, } 
to {q;}. Thus we may as well suppose that y, € Lg. Using Proposition 6.7 and 
passing to a subsequence, we can assume 

(6.43) ~Yy—y inC(D,R"), 

and we can furthermore arrange 


(6.44) Yy —>y weakly in H'(D,R"). 


Of course, by interior estimates for harmonic functions, we have 
b 
Pp 
2 . ‘ ° 9(z) 
-_ ey 
a 


FIGURE 6.2 Mapping of an Arc 
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P2 9 9(z) 
a 
a 
Py we, 


FIGURE 6.3 Alternative Mapping of an Arc 


(6.45) ~y —y inC*(D,R”). 

The limit function y is certainly harmonic on D. By (6.44), we of course have 
(6.46) Dy) < jim (yr) = dy. 

Now (6.34) applies to y, so we have 

(6.47) aly) < 50) < 


On the other hand, (6.43) implies that ¢ : 0D — y is monotone. Thus ¢y belongs 
to X,. Hence we have 


(6.48) aly) = A, 
This proves Theorem 6.5 and most of the following more precise result. 


Theorem 6.8. [fy is a smooth, simple, closed curve in IR”, there exists a contin- 
uous map yp : D — R” such that 


(6.49) B(y) =d, and aly) = A,, 
(6.50) y : D —+ R” is harmonic and essentially conformal, 
(6.51) y:S! —+ 4, homeomorphically. 


Proof. We have (6.49) from (6.46)—(6.48). By the argument involving (6.31) and 
(6.32), this forces vy to be essentially conformal. It remains to demonstrate (6.51). 

We know that y : S' > y, monotonically. If it fails to be a homeomorphism, 
there must be an interval J C S$‘ on which y is constant. Using a linear fractional 
transformation to map D conformally onto the upper half-plane Qt C C, we can 
regard » as a harmonic and essentially conformal map of 2+ — R”, constant 
on an interval J on the real axis R. Via the Schwartz reflection principle, we can 
extend y to a harmonic function 
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p:C\(R\D— R". 


Now consider the holomorphic function ~ : C \ (R \ I) > C”, given by ~(¢) = 
Op /OC. As in the calculations leading to Proposition 6.3, the identities 


(6.52) \Ay|? —|O2y/? =0, Ayp- Hp =0, 


which hold on 2*, imply }97"_, (¢)* = 0 on Q*; hence this holds on C\(R\J), 
and so does (6.52). But since 0;y = 0 on J, we deduce that 02y = 0 on J, 
hence ~ = 0 on J, hence 7 = 0. This implies that y, being both R”-valued 
and antiholomorphic, must be constant, which is impossible. This contradiction 
establishes (6.51). 


Theorem 6.8 furnishes a generalized minimal surface whose boundary is a 
given smooth, closed curve in R”. We know that y is smooth on D. It has been 
shown by [Hild] that y is C°° on D when the curve ¥ is C®, as we have assumed 
here. It should be mentioned that Douglas and others treated the Plateau problem 
for simple, closed curves 7 that were not smooth. We have restricted attention to 
smooth + for simplicity. A treatment of the general case can be found in [Nit1]; 
see also [Nit2]. 

There remains the question of the smoothness of the image surface M = y(D). 
The map y : D — R” would fail to be an immersion at a point z € D where 
Vy(z) = 0. At such a point, the C”-valued holomorphic function ) = Op/0¢ 
must vanish; that is, each of its components must vanish. Since a holomorphic 
function on D C C that is not identically zero can vanish only on a discrete set, 
we have the following: 


Proposition 6.9. The map y : D > R” parameterizing the generalized minimal 
surface in Theorem 6.8 has injective derivative except at a discrete set of points 
in D. 


If Vy(z) = 0, then y(z) € M = vy(D) is said to be a branch point of the 
generalized minimal surface 17; we say M is a branched surface. If n > 4, there 
are indeed generalized minimal surfaces with branch points that arise via Theorem 
6.8. Results of Osserman [Oss2], complemented by [Gul], show that if n = 3, the 
construction of Theorem 6.8 yields a smooth minimal surface, immersed in R?. 
Such a minimal surface need not be imbedded; for example, if 7 is a knot in R°, 
such a surface with boundary equal to + is certainly not imbedded. If y is analytic, 
it is known that there cannot be branch points on the boundary, though this is open 
for merely smooth y. An extensive discussion of boundary regularity is given in 
Vol. 2 of [DHKW]. 

The following result of Rado yields one simple criterion for a generalized min- 
imal surface to have no branch points. 


Proposition 6.10. Let y be a smooth, closed curve in R”. If a minimal surface 
with boundary yy produced by Theorem 6.8 has any branch points, then y has the 
property that 
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or some p € IR", every hyperplane through 
(6.53) fe Pp ry hyperp ghp 


intersects ‘y in at least four points. 


Proof. Suppose z) € D and Vy(zo) = 0, so W = Oy/OC vanishes at zp. Let 
L(x) = a-a-+c = 0 be the equation of an arbitrary hyperplane through p = (zo). 
Then h(x) = L(y(a)) is a (real-valued) harmonic function on D, satisfying 


(6.54) Ah=0onD, Vah(z) =0. 
The proposition is then proved, by the following: 


Lemma 6.11. Any real-valued h € C®°(D)M C(D) having the property (6.54) 
must assume the value h( zo) on at least four points on OD. 


We leave the proof as an exercise for the reader. 
The following result gives a condition under which a minimal surface con- 
structed by Theorem 6.8 is the graph of a function. 


Proposition 6.12. Let O be a bounded convex domain in R? with smooth 
boundary. Let g : OO — IR"~? be smooth. Then there exists a function 


(6.55) f<eC™(O,R”) nC(O.R* *), 


whose graph is a minimal surface, and whose boundary is the curve y C R” that 
is the graph of g, so 


(6.56) f=g onoo. 


Proof. Let y : D + R” be the function constructed in Theorem 6.8. Set F(x) = 
(yi(x), y2(x)). Then F : D —> R? is harmonic on D and F maps St = 0D 
homeomorphically onto 0O. It follows from the convexity of O and the maximum 
principle for harmonic functions that F : D + O. 

We claim that DF'(x) is invertible for each x € D. Indeed, if x9 € D and 
DF (ao) is singular, we can choose nonzero ~@ = (a1,Q2) € R? such that, at 
v= XO, 9 9 

1 p2 : 
"a, wie og, =0, j=1,2. 
Then the function h(x) = a y1(x%) + agyo(x) has the property (6.54), so h(x) 
must take the value h(x) at four distinct points of OD. Since F : 0D + 00 is 
a homeomorphism, this forces the Jinear function a,x 1 + a2%2 to take the same 
value at four distinct points of OO, which contradicts the convexity of O. 

Thus fF’ : D — QO isa local diffeomorphism. Since F’ gives a homeomorphism 
of the boundaries of these regions, degree theory implies that F’ is a diffeomor- 
phism of D onto © and a homeomorphism of D onto ©. Consequently, the 
desired function in (6.55) is f = Go F—', where G(x) = (2(z),.-.,Yn(z)). 
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Functions whose graphs are minimal surfaces satisfy a certain nonlinear PDE, 
called the minimal surface equation, which we will derive and study in § 7. 

Let us mention that while one ingredient in the solution to the Plateau problem 
presented above is a version of the Riemann mapping theorem, Proposition 6.4, 
there are presentations for which the Riemann mapping theorem is a consequence 
of the argument, rather than an ingredient (see, e.g., [Nit2]). 

It is also of interest to consider the analogue of the Plateau problem when, 
instead of immersing the disk in R” as a minimal surface with given boundary, 
one takes a surface of higher genus, and perhaps several boundary components. 
An extra complication is that Proposition 6.4 must be replaced by something more 
elaborate, since two compact surfaces with boundary which are diffeomorphic to 
each other but not to the disk may not be conformally equivalent. One needs to 
consider spaces of “moduli” of such surfaces; Theorem 4.2 of Chap. 5 deals with 
the easiest case after the disk. This problem was tackled by Douglas [Dou2] and 
by Courant [Cou2], but their work has been criticized by [ToT] and [Jos], who 
present alternative solutions. The paper [Jos] also treats the Plateau problem for 
surfaces in Riemannian manifolds, extending results of [Mor1]. 

There have been successful attacks on problems in the theory of minimal 
submanifolds, particularly in higher dimension, using very different techniques, 
involving geometric measure theory, currents, and varifolds. Material on these 
important developments can be found in [Alm, Fed, Morg]. 

So far in this section, we have devoted all our attention to minimal submani- 
folds of Euclidean space. It is also interesting to consider minimal submanifolds 
of other Riemannian manifolds. We make a few brief comments on this topic. 
A great deal more can be found in [Cher, Law, Law2, Morl1, Pi] and in survey 
articles in [Bom]. 

Let Y be a smooth, compact Riemannian manifold. Assume Y is isometrically 
imbedded in R”, which can always be arranged, by Nash’s theorem. Let M/ be a 
compact, k-dimensional submanifold of Y. We say M is a minimal submanifold 
of Y if its k-dimensional volume is a critical point with respect to small variations 
of M, within Y. The computations in (6.1)—(6.13) extend to this case. We need to 
take X = X(s,u) with 0,X(s,u) = €(s, uw), tangent to Y, rather than X(s, u) = 
Xo(u) + s€(u). Then these computations show that M is a minimal submanifold 
of Y if and only if, foreach x € M, 


(6.57) H(x) 4. TY, 


where $)(x) is the mean curvature vector of M (as a submanifold of R”), defined 
by (6.13). 

There is also a well-defined mean curvature vector Hy (x) € TY, orthogonal 
to TM, obtained from the second fundamental form of M as a submanifold 
of Y. One sees that Hy (a) is the orthogonal projection of §(a) onto TY, so the 
condition that M/ be a minimal submanifold of Y is that Hy = 0 on M. 

The formula (6.10) continues to hold for the isometric imbedding X : M —- 
R”. Thus MM is a minimal submanifold of Y if and only if, for each « € M, 
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(6.58) AX (x) 1 T,Y. 


If dim M = 2, the formula (6.15) holds, so if M is given a new metric, confor- 
mally scaled by a factor e?“, the new Laplace operator A; has the property that 
A,X = e 7¥AX, hence is parallel to AX. Thus the property (6.58) is unaf- 
fected by such a conformal change of metric; we have the following extension of 
Proposition 6.2: 


Proposition 6.13. If M is a Riemannian manifold of dimension 2 and X : M — 
R” is a smooth imbedding, with image M, CY, then M, is a minimal submani- 
fold of Y provided X : M — My, is conformal and, for each x € M, 


(6.59) AX (ax) L Tx (ay: 


We note that (6.59) alone specifies that X is a harmonic map from M into Y. 
Harmonic maps will be considered further in §$ 11 and 12B; they will also be 
studied, via parabolic PDE, in Chap. 15, § 2. 


Exercises 


1. Consider the Gauss map N : M — S?, for a smooth, oriented surface M Cc R°. 
Show that NV is antiholomorphic if and only if M is a minimal surface. 
(Hint: If N(p) = q, DN(p) : TpM — T,S? & TpM is identified with — Aw. Com- 
pare (4.67) in Appendix C. Check when Ay J = —JAn, where J is counterclockwise 
rotation by 90°, on TM.) Thus, if we define the antipodal Gauss map N : M — S? 
by N(p) = —N(p), this map is holomorphic precisely when M is a minimal surface. 

2. Ifx € S? c R’, pick v € T,S? C R°, setw = Jv € T,S” C R?, and take 
€ =v+iw € C?. Show that the one-dimensional, complex span of € is independent 
of the choice of v, and that we hence have a holomorphic map 


=: 9? — cp’. 


Show that the image =(S”) C CP® is contained in the image of {¢ ¢ C?\0: 
¢? + C3 + C3 = 0} under the natural map C? \ 0 > CP?. 

3. Suppose that M/ C R? is a minimal surface constructed by the method of Proposition 
6.3, via X :Q— M C R®. Define U : O > C?\ 0 by U = (a1, Ye, Ws), and define 
X : Q — CP? by composing W with the natural map C? \ 0 > CP*. Show that, for 
ue Q, _ 

X(u) = Zo N(X(u)). 

For the relation between ~; and the Gauss map for minimal surfaces in R", n > 3, 
see [Law]. 

4. Give a detailed demonstration of (6.38). 

5. In analogy with Proposition 6.4, extend Theorem 4.3 of Chap. 5 to the following result: 


Proposition. [f M is a compact Riemannian manifold of dimension 2 which is 
homeomorphic to an annulus, then there exists a conformal diffeomorphism 


Uv: M — &Q,, 
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10. 


for aunique p € (0,1), where A, ={z EC: p< |z| <1}. 


. If IL is the second fundamental form of a minimal hypersurface 1M C R"”, show 


that [7 has divergence zero. As in Chap. 2, § 3, we define the divergence of a second- 
order tensor field T by T?*.;,. (Hint: Use the Codazzi equation (cf. Appendix C, § 4, 
especially (4.18)) plus the zero trace condition.) 


. Similarly, if 7 is the second fundamental form of a minimal submanifold M of codi- 


mension 1 in S” (with its standard metric), show that TI has divergence zero. 
(Hint: The Codazzi equation, from (4.16) of Appendix C, is 


(Vy T1)(X, Z) - (Vy TD(Y, Z) = (R(X, Y)Z,N), 


where V is the Levi—Civita connection on M; X,Y, Z are tangent to M; Z is normal 
to M (but tangent to S”); and R is the curvature tensor of S”. In such a case, the right 
side vanishes. (See Exercise 6 in § 4 of Appendix C.) Thus the argument needed for 
Exercise 6 above extends.) 


. Extend the result of Exercises 6—7 to the case where M is a codimension-! minimal 


submanifold in any Riemannian manifold Q with constant sectional curvature. 


. Let M be a two-dimensional minimal submanifold of S°, with its standard metric. 


Assume MM is diffeomorphic to S 2 Show that M must be a “great sphere” in S®. 
(Hint: By Exercise 7, II is a symmetric trace free tensor of divergence zero; that is, 
IT belongs to 

V = {u € C™(M, SGT”) : div u = 0}, 
a space introduced in (10.47) of Chap. 10. As noted there, when M is a Riemann 
surface, V & O(«K @ &). By Corollary 9.4 of Chap. 10, O(K @ «%) = 0 when M has 
genus g = 0.) 
Prove Lemma 6.11. 


6B. Second variation of area 


In this appendix to § 6, we take up a computation of the second variation of the 
area integral, and some implications, for a family of manifolds of dimension k, 
immersed in a Riemannian manifold Y. First, we take Y = R” and suppose the 
family is given by X(s, u) = Xo(u) + s&(u), as in (6.1)-(6.5). 


Suppose, as in the computation (6.2)-(6.5), that ||0:Xo0 A--- A O,Xo|| = 1 


on M, while E; = 0;Xo form an orthonormal basis of T,,/, for a given point 
x € M. Then, extending (6.3), we have 


(6b.1) A'(s) = 


ene eee) 
j=l |]O.X A+++ A Op X|| 


Consequently, A” (0) will be the integral with respect to du; --- du, of a sum of 
three terms: 
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(6b.2) 
— SU (AL Xo A+ NOE N+ ++ A OX, 1X0 A+++ A eX) 


x (O,Xo A+++ NOE A+++ A OpX0, 1X0 A+++ A OX) 


+25 (OXON NOE NN OVEN +++ AN OpX0, 1X0 A+++ A OpX0) 
1<j 

+ S(OXo N+ NOE N +++ NOpX0, AX N+ NOE A+++ NA OpX0). 
ij 


Let us write 


(6b.3) AcE; = > af Ex, 
e 


with EL; = 0;Xo as before. Then, as in (6.4), the first sum in (6b.2) is equal to 


(6b.4) - 2 aga}. 


Let us move to the last sum in (6b.2). We use the Weingarten formula 0;€ = 
Vjé — AcE;, to write this sum as 


(6b.5) eS = — (ViE, Vie), 


at x. Note that the first sum in (6b.5) cancels (6b.4), while the last sum in (6b.5) 
can be written as ||V+€||?. Here, V+ is the connection induced on the normal 
bundle of M/. 

Now we look at the middle term in (6b.2), namely, 


(66.6) 25° S > ag’at™ (By A+++ A Ep A+++ \ Em A+++ \ Ex, By N+++ A Eg), 


i<j l,m 


at x, where Ez appears in the 7th slot and E,,, appears in the jth slot in the k-fold 
wedge product. This is equal to 


(6b.7) 3 ba agial? — ai a) = 2 Tr A? Ag, 


<j 


at x. Thus we have 


(6b.8) A" (0) = J [iver +2Tr A? Ag] dA(2). 
M 
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If M is a hypersurface of R”, and we take € = f N, where N is a unit normal 
field, then ||V1€||? = ||Vf||? and (6b.7) is equal to 


(6b.9) 2) (R(E;, ,) Ei, Ej) f? = Sf, 
<j 


by the Theorema Egregium, where S is the scalar curvature of /. Consequently, 
if 1 C R” isa hypersurface (with boundary), and the hypersurfaces /, are given 
by (6.6), with area integral (6.2), then 


(6b.10) A’(0) = J [iver + S(ax)f?| dA(z). 
M 


Recall that when dim M = 2,so M Cc R?°, 
(6b.11) S =2K, 


where KC is the Gauss curvature, which is < 0 whenever M is a minimal surface 
in R°. 
If / has general codimension in R”, we can rewrite (6b.8) using the identity 


(6b.12) 2 Tr A? Ag = (Tr Ag)? — ||Aell?, 
where ||A¢|| denotes the Hilbert-Schmidt norm of Ae, that is, 
|| Agl]? = Tr(Ag Ae). 
Recalling (6.13), if k = dim M, we get 
(66.13) AMO) = f [I VE? = Ae? + (8 (2),€)°] AC). 
M 


Of course, the last term in the integrand vanishes for all compactly supported 
fields € normal to M when M is a minimal submanifold of R”. 

We next suppose the family of manifolds M/, is contained in a manifold 
Y Cc R”. Hence, as before, instead of X(s,u) = Xo(u) + sé(u), we require 
O,X(s,u) = €(s,u) to be tangent to Y. We take X(0,u) = Xo(u). Then (6b.1) 
holds, and we need to add to (6b.2) the following term, in order to compute A’’ (0): 


k 
= O1Xo9 A+++ AN OjK A+++ A OpX0, 1X0 A+++ A OnXo), 
(6b.14) 2 1440 jh kAA0, C140 k; 0) 
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If, as before, 0; Xo = E; form an orthonormal basis of T;,.M, for a given x € M, 
then 


k 
(6b.15) = Ss (O;K,E;), ata. 
j=l 


Now, given the compactly supported field €(0, u), tangent to Y and normal to 
M, let us suppose that, for each u, Yu(s) = X(s, u) is a constant-speed geodesic 
in Y, such that 7/,(0) = €(0,u). Thus & = 7//(0) is normal to Y, and, by the 
Weingarten formula for 1 Cc R”, 


(6b.16) O;K = Vik — A,E;, 


at x, where V! is the connection on the normal bundle to M@ C R” and A is as 
before the Weingarten map for 1/7 Cc R”. Thus 


(6b.17) 6 =—) (A, Bj, Bj) = —Tr Ay = —k(9(2), 4), 
j 
where k = dim M. 
If we suppose M is a minimal submanifold of Y, then §(2) is normal to 


Y, so, for any compactly supported field €, normal to M and tangent to Y, the 
computationss (6b.13) supplemented by (6b.14)—(6b.17) gives 


(66.18) AN(0) = f [IV*EI? = [Ae — (8(2), 8)] AC. 
M 


Recall that A¢ is the Weingarten map of M C R”. 
We prefer to use Bg, the Weingarten map of MV C Y. It is readily verified that 


(6b.19) Ag = Be € End T;M 


if€ € T,Y and € 1 T;,M; see Exercise 13 in § 4 of Appendix C. Thus in (6b.18) 
we can simply replace || A¢||* by ||Be||?. Also recall that V' in (6b.18) is the 
connection on the normal bundle to 14 C R”. We prefer to use the connection 
on the normal bundle to M C Y, which we denote by V*. To relate these two 
objects, we use the identities 


dj, = ViE—AgE;, O)€ = VjE+ LY (Ej,8), 


(6b.20) os 4 
Vie = VEE - BeEy, 


where Vv denotes the covariant derivative on Y, and II is the second fundamen- 
tal form of Y C R”. In view of (6b.19), we obtain 
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(6b.21) Vie = Vie + II” (Ej, 8), 
a sum of terms tangent to Y and normal to Y, respectively. Hence 


(6b.22) IV7El? = VE? + 5° I (EB; O12. 
J 


Thus we can rewrite (6b.18) as 


(66.23) A"(0) = f [I VE? — Bel? + ILE” (B;,8)IP — TAs] AAC). 
j 


M 


We want to replace the last two terms in this integrand by a quantity defined 
intrinsically by MM, C Y, not by the way Y is imbedded in R”. Now Tr A, = 
> UI™ (Ej, Ej), %), where [I™ is the second fundamental form of M Cc R”. 
On the other hand, it is easily verified that 
(6b.24) k= (0) = II” (E,€). 


Thus the last two terms in the integrand sum to 


(6.25) W = S| IL” (,€)|? — "(6 6), (B;,B;))]. 
j 


We can replace II” (Ej, Ej) by II* (Ej, E;) here, since these two objects have 
the same component normal to Y. Then Gauss’ formula implies 


(6b.26) v= ae, (€, E;)E, E;), 


where RY is the Riemann curvature tensor of Y. We define ® € End N,M, 
where N(M) is the normal bundle of N c Y, by 


(6b.27) (R(E),n) = (RY (€, Ej), Ey), 


at x, where {£;} is an orthonormal basis of T,,/. It follows easily that this is 
independent of the choice of such an orthonormal basis. 
Our calculation of A’’(0) becomes 


(6b.28) A'(0) = / [Iv*el? — || Bell? + (R(®),8)| dA(zx) 


M 
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when M is a minimal submanifold of Y, where V* is the connection on the 
normal bundle to M C Y, Bis the Weingarten map for M C Y, and % is defined 
by (6b.27). If we define a second-order differential operator £9 and a zero-order 
operator B on C§° (IM, N(M)) by 
(6b.29) Lok = (VF VRE, (B(E),n) = Tr(By Be), 
respectively, we can write this as 
(6b.30) A"(0) = (£€,4)n20), LE = Lof — BE) + HEE). 

We emphasize that these formulas, and the ones below, for A” (0) are valid for 
immersed minimal submanifolds of Y as well as for imbedded submanifolds. 


Suppose that (7 has codimension | in Y and that Y and ™ are orientable. 
Complete the basis {;} of T,,M to an orthonormal basis 


{Bj:1<j<k+1} 
of TY. In this case, F,41(a) and €(x) are parallel, so 
(RY (€, Ex+1)n, xsi) = 0. 
Thus (6b.27) becomes 
(6b.31) R(€) = —Ric’€ ifdimY = dim M +1, 


where Ric” denotes the Ricci tensor of Y. In such a case, taking € = fEpii1 = 
fv, where v is a unit normal field to /, tangent to Y, we obtain 


A™0) =f [IVA = (IBUIP + Ric” v,»)) [FP] dA(c) 


(6b.32) ce 
= (Lf, f)ra); 
where 
(6b.33) Lf=—-Af+ef, p=-||BLI? — Ric*y,v). 


We can express y in a different form, noting that 


k 
(6b.34) (Ric v,v) = SY — }"(RicY E;, E;), 


j=1 
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where S* is the scalar curvature of Y. From Gauss’ formula we readily obtain, 
for general M Cc Y of any codimension, 


(Ric’ Ej, E;) = (RY (E;,v)v, E;) + (Ric™ E;, E;) 


(6b.35) + S° ||I1(;, Ee)||? — k( Sy, I(E;, ;)), 
£ 


where JJ denotes the second fundamental form of 14 Cc Y. Summing over 1 < 
j <k, when M has codimension | in Y, and v is a unit normal to M, we get 


(6b.36) 2(Ric’ v,v) = SY — S™ — ||B,]|? + ||Sy|/?. 


If M is a minimal submanifold of Y of codimension 1, this implies that 


A ig 1 
y= 5 5°) 5B? 
(6b.37) i 

= 5s =" eA 2, 


We also note that when dim M = 2 and dim Y = 3, then, for x € M, 
(6b.38) Tr A*B, (2) = K™ (x) — K* (T,,.M), 


where K™ = (1/2)S™ is the Gauss curvature of M and KY (T,M) is the 
sectional curvature of Y, along the plane T,, M. 

We consider another special case, where dim M = 1. We have (R(E€),£) = 
—|€|?K* (Ime), where KY (I,s¢) is the sectional curvature of Y along the plane 
in T,Y spanned by T’,M and €. In this case, to say M is minimal is to say it is 
a geodesic; hence Be = 0 and Vi#é = VE, where V is the covariant derivative 
on Y, and T is a unit tangent vector to M. Thus (6b.28) becomes the familiar 
formula for the second variation of arc length for a geodesic: 


(6b.39) £"(0) = | [Irel? - gP?K* (ILye)] ds, 
ay. 


where we have used ¥ instead of M to denote the curve, and also @ instead of A 
and ds instead of dA, to denote arc length. 

The operators £ and L are second-order elliptic operators that are self-adjoint, 
with domain H?(M), if M is compact and without boundary, and with domain 
H?(M) 7 H4(M), if M is compact with boundary. In such cases, the spectra of 
these operators consist of eigenvalues A; _/“ +-oo. If M is not compact, but B and 
RK are bounded, we can use the Friedrichs method to define self-adjoint extensions 
£ and L, which might have continuous spectrum. 


6B. Second variation of area 177 


We say a minimal submanifold M C Y is stable if A’’(0) > 0 for all smooth, 
compactly supported variations €, normal to M/ (and vanishing on 0M). Thus the 
condition that / be stable is that the spectrum of £ (equivalently, of L, if codim 
M = 1) be contained in [0, 00). In particular, if MM is actually area minimizing 
with respect to small perturbations, leaving OM fixed (which we will just call 
“area minimizing”), then it must be stable, so 


(6b.40) M area minimizing => spec £ C [0, 00). 


The second variational formulas above provide necessary conditions for a 
minimal immersed submanifold to be stable. For example, suppose /V/ is a bound- 
aryless, codimension-! minimal submanifold of Y, and both are orientable. Then 
we can take f = 1 in (6b.32), to get 


(6b.41) M stable => f (we + (Ric” v, v)) dA <0. 
M 


If dim M = 2 and dim Y = 3, then, by (6b.37), we have 


(6b.42) M stable —=> f (ve +s" — aK) dA <0. 
M 


In this case, if M has genus g, the Gauss—Bonnet theorem implies that 
f K™ dA = 4n(1— 4g), so 


(6b.43) M stable —> f (WP as s¥) dA < 8r(1— 4). 


M 


This implies some nonexistence results. 


Proposition 6b.1. Assume that Y is a compact, oriented Riemannian manifold 
and that Y and M have no boundary. 

If the Ricci tensor Ric* is positive-definite, then Y cannot contain any com- 
pact, oriented, area-minimizing immersed hypersurface M. If Ric’ is positive- 
semidefinite, then any such M would have to be totally geodesic in Y . 

Now assume dim Y = 3. If Y has scalar curvature SY > 0 everywhere, then 
Y cannot contain any compact, oriented, area-minimizing immersed surface M 
of genus g = 1. 

More generally, if SY > 0 everywhere, and if M is a compact, oriented, 
immersed hypersurface of genus g => 1, then for M to be area minimizing it 
is necessary that g = 1 and that M be totally geodesic in Y . 


R. Schoen and S.-T. Yau [SY] obtained topological consequences for a com- 
pact, oriented 3-manifold Y from this together with the following existence 
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theorem. Suppose M is a compact, oriented surface of genus g > 1, and sup- 
pose the fundamental group 7(Y) contains a subgroup isomorphic to 7(M). 
Then, given any Riemannian metric on Y, there is a smooth immersion of 
into Y which is area minimizing with respect to small perturbations, as shown in 
[SY]. It follows that if Y is a compact, oriented Riemannian 3-manifold, whose 
scalar curvature S* is everywhere positive, then 7,(Y) cannot have a subgroup 
isomorphic to 7(M/), for any compact Riemann surface M of genus g > 1. 

We will not prove the result of [SY] on the existence of such minimal immer- 
sions. Instead, we demonstrate a topological result, due to Synge, of a similar 
flavor but simpler to prove. It makes use of the second variational formula (6b.39) 
for arc length. 


Proposition 6b.2. If Y is a compact, oriented Riemannian manifold of even 
dimension, with positive sectional curvature everywhere, then Y is simply 
connected. 


Proof. It is a simple consequence of Ascoli’s theorem that there is a length- 
minimizing, closed geodesic in each homotopy class of maps from S' to Y. Thus, 
if 7:(Y) # 0, there is a nontrivial stable geodesic, 7. Pick p € 7, € normal to 
7 at p (i.e., &) € Np(y)), and parallel translate € about 7, obtaining €,, € Np(y) 
after one circuit. This defines an orientation-preserving, orthogonal, linear trans- 
formation T : Nyy — Npy. If Y has dimension 2k, then N,v has dimension 
2k — 1, so 7 € SO(2k — 1). It follows that 7 must have an eigenvector in Np, 
with eigenvalue 1. Thus we get a nontrivial, smooth section € of N (+) which is 
parallel over y, so (6b.39) implies 


(6b.44) / K* (Tye) ds < 0. 
Y 


If KY (I) > 0 everywhere, this is impossible. 


One might compare these results with Proposition 4.7 of Chap. 10, which states 
that if Y is a compact Riemannian manifold and Ric’ > 0, then the first coho- 
mology group H!(Y) = 0. 


7. The minimal surface equation 


We now study a nonlinear PDE for functions whose graphs are minimal surfaces. 
We begin with a formula for the mean curvature of a hypersurface M Cc R"+! 
defined by u(x) = c, where Vu # 0 on M. If N = Vu/|Vul, we have the 
formula 


(7.1) (Ay X,Y) = —|Vu|71(D?u)(X,Y), 
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for X,Y € TM, as shown in (4.26) of Appendix C. To take the trace of the 
restriction of D?u to T,M, we merely take Tr(D?u) — D?u(N, N). Of course, 
Tr(D?u) = Au. Thus, for z € M, 


(72) Tr Ay(z) = —|Vu(x)|7? [Au —|Vul-2D?u(Vu, Vu) |. 
Suppose now that /V is given by the equation 


Insi= f(a’), a =(x21,...,2n). 


Thus we take u(x) = %n4i — f(a’), with Vu = (—Vf,1). We obtain for the 
mean curvature the formula 


(3) nila) =—7eae [IVA AL— DUVET] = MCN, 


where (Vf)? = 1+|Vf(a’)|?. Written out more fully, the quantity in brackets 
above is 


Pf of Of _ 


x5 OX j Ox; Ox; 7 


(7.4) (A+ IVFP)AF- S05 M(f). 
ij 


Thus the equation stating that a hypersurface x,41 = f(a’) be a minimal sub- 
manifold of R"*? is 


(7.5) M(f) =0. 
In case n = 2, we have the minimal surface equation, which can also be written as 
(7.6) (1+ |dof|?) Of — 2(Of - Of) Adof + (1+ |O.f|?) Ff =0. 


It can be verified that this PDE also holds for a minimal surface in R” described 
by 2” = f(a’), where x” = (ax3,...,%n), if (7.6) is regarded as a system of k 
equations in k unknowns, k = n — 2, and (0, f - 02f) is the dot product of R*- 
valued functions. We continue to denote the left side of (7.6) by M (f). 

Proposition 6.12 can be translated immediately into the following existence 
theorem for the minimal surface equation: 


Proposition 7.1. Let O be a bounded, convex domain in R? with smooth bound- 
ary. Let g € C®(0O, R*) be given. Then there is a solution 


(7.7) u € C(O, R*)N C(O, R*) 
to the boundary problem 


(7.8) M(u) =0, ulso =9- 
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When k = 1, we also have uniqueness, as a consequence of the following: 


Proposition 7.2. Let O be any bounded domain in R”. Let u; € C(O) C(O) 
be real-valued solutions to 


(7.9) M(u;) =0, uj; = gj on OO, 
for j =1,2. Then 
(7.10) 91 < gg on OO => uy < ug onO. 


Proof. We prove this by deriving a linear PDE for the difference v = ug — u1 
and applying the maximum principle. In general, 


1 
(7.11) P(u2) — O(u;)= Lv, L= | D® (rug +(1- T)ur) dr. 
0 


Suppose ® is a second-order differential operator: 

(7.12) G(u) = F(u,du,07u), F = F(u,p,¢). 

Then, as in (3.4), 

(7.13) D®(u) = Fe(u, Ou, 0?u) 0?u + F,(u, Ou, 0?u) Ov + Fy (u, Ou, Ou). 


When ®(u) = M(u) is given by (7.4), F.,(u, €,¢) = 0, and we have 


(7.14) DM(u)v = A(u)v + B(u)v, 
where 
Ou Ou 0?v 
71 A =(1 2VA 
(7.15) (u)u ( + |Vul ) v 2 85, ae Ede 


is strongly elliptic, and B(u) is a first-order differential operator. Consequently, 
we have 


(7.16) M(uz) — M(u1) = Av + Bo, 


where A = So A(rug +(1- T)uz) dr is strongly elliptic of order 2 at each point 
of O, and B is a first-order differential operator, which annihilates constants. If 
(7.9) holds, then Av + Bu = 0. Now (7.10) follows from the maximum principle, 
Proposition 2.1 of Chap. 5. 

We have as of yet no estimates on |Vu,;(a)| as x ++ OO, so A, which is elliptic 
in O, could conceivably degenerate at OO. To achieve a situation where the results 
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of Chap. 5, § 2, apply, we could note that the hypotheses of Proposition 7.2 imply 
that, for any ¢ > 0, uy < ug+e ona neighborhood of OO. Alternatively, one can 
check that the proof of Proposition 2.1 in Chap. 5 works even if the elliptic oper- 
ator is allowed to degenerate at the boundary. Either way, the maximum principle 
then applies to yield (7.10). 


While Proposition 7.2 is a sort of result that holds for a large class of second- 
order, scalar, elliptic PDE, the next result is much more special and has interesting 
consequences. It implies that the size of a solution to the minimal surface equation 
(7.8) can sometimes be controlled by the behavior of g on part of the boundary. 


Proposition 7.3. Let O Cc R? be a domain contained in the annulus 7 < |x| < ra, 
and let u € C?(O) N C(O) solve M(u) = 0. Set 


(7.17) G(x;r) =r cosh™' () _ for|z|>r, G(a;r) <0. 
If 
(7.18) ula) < G(a;r1) + Mon {2 € 00: |x| > ri}, 
for some M € R, then 
(7.19) u(x) < G(a;r1)+ Mon O. 
Here, z = G(x;1r1) defines the lower half of a catenoid, over {x € R? : 


|x| > 11}. This function solves the minimal surface equation on |x| > 11 and 
vanishes on |x| = 11. 


Proof. Given s € (11,72), let 


(7.20) e(s) = max |G(2; r1) — G(a; s)|. 


s<|e|<ra 
The hypothesis (7.18) implies that 

(7.21) u(x) < G(a;s) + M + e(s) 

on {a € OO: |x| > s}. We claim that (7.21) holds for x in 

(7.22) O(s) =ON {x: 8 < |x| < ro}. 

Once this is established, (7.19) follows by taking s \y 11. To prove this, it suffices 
by Proposition 7.2 to show that (7.21) holds on 0O(s). Since it holds on OO, it 


remains to show that (7.21) holds for x in 


(7.23) C(s) = ON {a: |x| = s}, 
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FIGURE 7.1 Nonconvex Region O 


FIGURE 7.2 Another Nonconvex Region O 


illustrated by a broken arc in Fig. 7.1. If not, then u(a) — G(a; 5) would have a 
maximum M, > M + é(s) at some point p € C(s). By Proposition 7.1, we have 
u(x) — G(a;s) < M; on O(s). However, Vu(x) is bounded on a neighborhood 
of p, while 


3) 
(7.24) Bp C(t s)=-oo on|a|=s. 
This implies that u(x) — G(a; 5) > My, for all points in O(s) sufficiently near p. 
This contradiction shows that (7.21) must hold on C(s), and the proposition is 
proved. 


One implication is that if O C R? is as illustrated in Fig. 7.1, it is not pos- 
sible to solve the boundary problem (7.8) with g prescribed arbitrarily on all of 
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dO. A more precise statement about domains O C R? for which (7.8) is always 
solvable is the following: 


Proposition 7.4. Let O C R? be a bounded, connected domain with smooth 
boundary. Then (7.8) has a solution for all g € C%(0O) if and only if O is 
convex. 


Proof. The positive result is given in Proposition 7.1. Now, if O is not convex, 
let p € OO bea point where O is concave, as illustrated in Fig. 7.2. Pick a disk D 
whose boundary C is tangent to 0O at p and such that, near p, C intersects the 
complement 0* only at p. Then apply Proposition 7.3 to the domain O = O \ D, 
taking the origin to be the center of D and r, to be the radius of D. We deduce 
that if u solves M(u) = 0 on O, then 


(7.25) uz) < M+ G(a;r1) ondO\ D = u(p) < M, 


which certainly restricts the class of functions g for which (7.8) can be solved. 


Note that the function v(x) = G(x; 1) defined by (7.17) also provides an exam- 
ple of a solution to the minimal surface equation (7.8) on an annular region 


O={2 ER? :r < |z2| < st}, 


with smooth (in fact, locally constant) boundary values 
-1 8 

v=Oon|z|=r, v= -—r cosh —on|a|=s, 
r 


which is not a smooth function, or even a Lipschitz function, on O. This is another 
phenomenon that is different when O is convex. We will establish the following: 


Proposition 7.5. If O C R? is a bounded region with smooth boundary which is 
strictly convex (i.e., OO has positive curvature), and g © C®° (OO) is real-valued, 
then the solution to (7.8) is Lipschitz at each point xo € OO. 


Proof. Given %) € OO, we have 2 = (xo, 9(a0)) € 7 C R®, where + is the 
boundary of the minimal surface M which is the graph of z = u(x). The strict 
convexity hypothesis on O implies that there are two planes I; in R® through 
zo, such that II, lies below y and II above y, and II; are given by z = aj: 
(w—29)+9(x0) = Wjxo (x), a; = a;(@0) € R®. There is an estimate of the form 


(7.26) |a;(xo)| S$ K(20)||9 © Prolloe: 


where p,,, is the radial projection (from the center of O) of OO onto a circle C(x) 
containing O and tangent to 0O at x9, and K (a9) depends on the curvature of 
C(ao). Now Proposition 7.2 applies to give 
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(7.27) Wao (x) < u(a) as W229 (2), rE 0,7 


since linear functions solve the minimal surface equation. This establishes the 
Lipschitz continuity, with the quantitative estimate 


(7.28) |u(rp) — u(x)| < Ala—azol, 2% € 00, rE O, 
where 
(7.29) A= sup_ |ai(zo)| + |a2(z0)|. 

x20 €0O 


This result points toward an estimate on |Vu(zx)|, x € ©, for a solution to 
(7.8). We begin the line of reasoning that leads to such an estimate, a line that 
applies to other situations. First, let’s rederive the minimal surface equation, as 
the stationary condition for 


(7.30) I(u) = / F(Vu(a)) da, 
oO 
where 
1/2 
(731) F(p) = (1+?) 


so (7.30) gives the area of the graph of z = u(x). The method used in Chapter 2, 
§ 1, yields the PDE 


(7.32) S¢ A (Vu) 0,0ju = 0, 

where 

(7.33) A‘) (p) = OF 
OpiOp; 


Compare this with (1.68) and (1.36) of Chap. 2. When F'(p) is given by (7.31), 
we have 


(7.34) A¥(p) = (p)~* (8:5 (P)? — pips), 
so in this case (7.32) is equal to —M(u), defined by (7.3). Now, when wu is a 
sufficiently smooth solution to (7.32), we can apply 0g = 0/0xz to this equation 


and obtain the PDE 


(7.35) S— 0;A% (Vu) djwe = 0, 
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for we = Ocu, not for all PDE of the form (7.32), but whenever A’/(p) is sym- 
metric in (i,j) and satisfies 


OAY oA 
(7.36) = 
ODm Op; 


which happens when A‘ (p) has the form (7.33). If (7.35) satisfies the ellipticity 
condition 


(7.37) S > A¥ (Vu(x)) &€; = C(a)lé?, C(x) > 0, 


for « € O, then we can apply the maximum principle, to obtain the following: 


Proposition 7.6. Assume u € C'(Q) is real-valued and satisfies the PDE (7.32), 
with coefficients given by (7.33). If the ellipticity condition (7.37) holds, then 
Ogu(x) assumes its maximum and minimum values on OO; hence 


(7.38) sup |Vu(a)| = sup |Vu(z)]. 
2EO xEd0O 


Combining this result with Proposition 7.5, we have the following: 


Proposition 7.7. Let O C R? be a bounded region with smooth boundary which 
is strictly convex, g © C® (OQ) real-valued. If u € C?(O) NC1(O) is a solution 
to (7.8), then there is an estimate 


(7.39) lull cag) < CO) Ilglic2(ao). 


Note that the existence result of Proposition 7.1 does not provide us with the 
knowledge that u belongs to C1(O), and thus it will take further work to demon- 
strate that the estimate (7.39) actually holds for an arbitrary real-valued solution 
to (7.8) when O C R? is strictly convex and g is smooth. We will be in a position 
to establish this result, and further regularity, after sufficient theory is developed 
in the next two sections. See in particular Theorem 10.4. For now, we can regard 
this as motivation to develop the tools in the following sections, on the regularity 
of solutions to elliptic boundary problems. 

We next look at the Gauss curvature of a minimal surface M/, given by z = 
u(x), x € O C R?. Fora general u, the curvature is given by 


_ 2) —-2 Oru 
(7.40) K = (1+ |Vul?)~* det (seo) 


See (4.29) in Appendix C. When u satisfies the minimal surface equation, there 
are some other formulas for K, in terms of operations on 


(7.41) B(x) = F(Vu)! = (1+ |Vul?)"?, 
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which we will list, leaving their verification as an exercise: 


= |\Vo|? 
(7.42) at eee of 
i 
(7.43) K = 55 9, 
(7.44) K =A log(1+®). 


Now if we alter the metric g induced on M via its imbedding in R? by a 
conformal factor: 


(7.45) g =(1+%)?g=ec?"g, v=log(14+9), 


then, as in formula (1.30), we see that the Gauss curvature k of M in the new 
metric is 


(7.46) k= (—Av+ K)e~”” =0; 


in other words, the metric g’ = (1 + ®)?g is flat! Using this observation, we can 
establish the following remarkable theorem of S. Bernstein: 


Theorem 7.8. [fu : R? > R is an everywhere-defined C?-solution to the mini- 
mal surface equation, then u is a linear function. 


Proof. Consider the minimal surface M given by z = u(x), x € R?, in the 
metric g’ = (1+ ©)?g, which, as we have seen, is flat. Now g’ > g, so this is 
a complete metric on M. Thus (M, g’) is isometrically equivalent to R?. Hence 
(M, g) is conformally equivalent to C. 

On the other hand, the antipodal Gauss map 


(7.47) N:M—S?, N=(Vu)71(Vu,-1), 


is holomorphic; see Exercise | of § 6. But the range of N is contained in the lower 
hemisphere of $7, so if we take S? = C U {oo} with the point at infinity identi- 
fied with the “north pole” (0, 0,1), we see that N yields a bounded holomorphic 
function on M = C. By Liouville’s theorem, N must be constant. Thus M is a 
flat plane in R®. 


It turns out that Bernstein’s theorem extends to u : R” — R, for n < 7, by 
work of E. DeGiorgi, F. Almgren, and J. Simons, but not to n > 8. 


Exercises 


1. If DM (uw) is the differential operator given by (7.14)-(7.15), show that its principal 
symbol satisfies 
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(7.48) —Op piu (@,€) = (1+ lel? )IEP? — @- &? > él’, 


where p = Vu/(z). 
2. Show that the formula (7.3) for M(f) is equivalent to 


(7.49) M(f) = » O;((Vf)* Of) = div((Vf)' VF). 


3. Give a detailed demonstration of the estimate (7.26) on the slope of planes that can lie 
above and below the graph of g over OO (assumed to have positive curvature), needed 
for the proof of Proposition 7.5. (Hint: In case OO is the unit circle S*, consider the 
cases g(@) = cos* 6.) 

4. Establish the formulas (7.42)—(7.44) for the Gauss curvature of a minimal surface. 


8. Elliptic regularity II (boundary estimates) 


We establish estimates and regularity for solutions to nonlinear elliptic bound- 
ary problems. We treat completely nonlinear, second-order equations, obtaining 
L?-Sobolev estimates for solutions assumed a priori to belong to O?+"(M), r > 
0. We make note of improved estimates for solutions to quasi-linear, second-order 
equations. In § 10 we will show how such results, when supplemented by the 
DeGiorgi—Nash—Moser theory, apply to the solvability of the Dirichlet problem 
for certain quasi-linear elliptic PDE. 

Though we restrict attention to second-order equations, the analysis in this 
section extends readily to higher-order elliptic systems, such as we treated in § 11 
of Chap. 5. The exposition here is taken from [T]. 

Having looked at interior regularity in §4, we restrict attention to a collar 
neighborhood of the boundary 0// = X, so we look at a PDE of the form 


(8.1) Oru = F(y,x, D?u, D5d,u), 
with y € [0,1], 2 © X. We set 
(8.2) vi = Au, v2 = Oyu, 


and produce a first-order system for v = (v1, v2), 


aes Avo, 

(8.3) — 
— = Fly, 2, D2A~'v1, Dyv2). 
Oy 


An operator like T = A or T = D2A~! does not map C*t+!*"(I x X) to 
Ck+r(I x X), but if we set 
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(8.4) Cera, err rere 
e>0 

then 

(8.5) TC (Ex X)— OFT eX), 


Thus we will assume u € C?*"*. This implies v € C!*"*, and the arguments 
D?A~+v, and D!v2 appearing in (8.3) belong to C"+. We will be able to drop 
the “+” in the statement of the main result. 

Now if we treat y as a parameter and apply the paradifferential operator con- 
struction developed in § 10 of Chap. 13 to the family of operators on functions of 
xz, we obtain 


F(y, x, D2A~ ‘vy, D3v2) = Ai(v;y, 2, De) 


8.6 
: y + Ag(u;y, 2, Dz)ve + R(v), 


with (for fixed y) R(v) € C™(X), 


(8.7) Aj(v;y,2,€) € ApSi1 C C’Sto N Six 
and 
(8.8) DEA, $1, for |B) <7, Sy t"-”, for |p| >, 


provided u € C2t'*, 
Note that if we write F = F(y,2,¢,7), Gy = Du (lal < 2), ne = 
D*dyu (ja| < 1), then we can set 


F 
(8.9) Biwy,26) = >- Fen (D3A 1, Dhaene 
lal<2 > 


(suppressing the y- and x-arguments of fF’) and 


aF 
(8.10) Bo(v;y,2,€) = D> -—(DZA7*01, Dy v2)é*. 
lal<1 - *™ 
Thus 
(8.11) vecitt —» A; — By EC™S{7". 


Using (8.4), we can rewrite the system (8.3) as 
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i Ava, 
(8.12) Bi 
a = A(x, D)v; + Ao(x, D)v2 + R(v). 


We also write this as 


(8.13) a = K(v;y,2,D,v+R (REC), 
y 


where K'(v; y, 7, Dx) is a 2 x 2 matrix of first-order pseudodifferential operators. 
Let us denote the symbol obtained by replacing A; by B; as K, so 


(8.14) K-KeC'sty’. 
The ellipticity condition can be expressed as 
(8.15) spec K(v;y,2,€) C {z EC: |Re z| > Clé|}, 


for |€| large. Hence we can make the same statement about the spectrum of the 
symbol K, for |€| large, provided v € C1t"* with r > 0. 

In order to derive L?-Sobolev estimates, we will construct a symmetrizer, in 
a fashion similar to § 11 in Chap.5. In particular, we will make use of Lemma 
11.4 of Chap. 5. Let F = E(v; y, x, €) denote the projection onto the {Re z > 0} 
spectral space of K, defined by 


(8.16) E(y, 2, €) = aa | — K(y,2,€)) dz, 


y 


where ¥ is a curve enclosing that part of the spectrum of K (y, x, €) contained in 
{Re z > 0}. Then the symbol 


(8.17) A=(2E-1)K €C’S), 


has spectrum in {Re z > 0}. (The symbol class C”.$” is defined as in (9.46) of 
Chap. 13.) Let P € C"S® be a symmetrizer for the symbol A, constructed via 
Lemma 11.4 of Chap. 5, namely, 


P(y,x,€) = ®(A(y, 2, €)), 


where ® is as in (11.54)-(11.55) in Chap. 5. Thus P and (PA+ A*P) are positive- 
definite symbols, for |€| > 1. 

We now want to apply symbol smoothing to P, A, and E. It will be convenient 
to modify the construction slightly, and smooth in both x and y. Thus we obtain 
various symbols in S7”';, with the understanding that the symbol classes reflect 
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estimates on D,, ,-derivatives. For example, we obtain (with 0 < 6 < 1) 
(8.18) P(y,2,€)€ S35; P—-Pecsys 


by smoothing P, in (y, 7). We set 


(8.19) Q= 5 (Plus, De) + Ply, De)*) + KA, 


with kX > 0 picked to make the operator Q positive-definite on L?(X). Similarly, 
define A and E by smoothing A and F in (y, x), so 
A(y,2,£) € Sts, A-AECTSIZ”, 


(8.20) 0 [- ra—ré 
Ety,z,€)€ S15, B-EBe C'S, ;°, 


and we smooth K, writing 
(8.21) K=Ko+K°; Ky€ Sis, K° EC STZ? NST”. 
Consequently, on the symbol level, 


ony A=(2E-1)Ko+A", A’ EST5", 
, PA+A*P>CIE|, for |&| large. 
Let us note that the homogeneous symbols K, E, and A commute, for each 


(y,x,€); hence the commutators of the various symbols kK, E, A have order 
<ré units less than the sum of the orders of these symbols; for example, 


(8.23) [E(y, x, €), Ko(y,2,£)] € $156. 


Using this symmetrizer construction, we will look for estimates for solutions 
to a system of the form (8.3) in the spaces Hy,,5(M/) = Hy,.(J x X), with norms 


k 
(8.24) llolls = >, AAP 0(y) Z2c<x): 
j=0 


We shall differentiate (QA*° Ev, A* Ev) and (QA*%(1 — E)v, A8(1 — E)v) with 
respect to y (these expressions being L?(X)-inner products) and sum the two 
resulting expressions, to obtain the desired a priori estimates, parallel to the 
treatment in § 11 of Chap. 5. 

Using (8.13), we have 
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 (QA*Bv, A*Bo) = 2 Re(QA°E(Kv + R), A° Ev) 
y 


(S22) + (Q'A*Bv, A’ Ev) 


+2 Re(QA*E'v, A* Ev). 


Note that given v € C!t"*,r > 0, Q! and E’ belong to OPS? 5. Hence, for 
fixed y, each of the last two terms is bounded by 


(8.26) Cllu(y) Il Frs+5/2- 


Here and below, we will adopt the convention that C = C(||v||c1+r+), with a 
slight abuse of notation. Namely, v € C'*"* belongs to C!*"t* for some € > 0, 
and we loosely use ||v||c1+r+ instead of ||v||c1+r+e. 

To analyze the first term on the right side of (8.25), we write 


(QAP E(Kvu + R), A° Ev) = (QA EKov, AP Ev) 
27) + (QA° Kv, AS Ev) 
+ (QA°ER, A’ Ev), 


where the last term is harmless and, for fixed y, 
(8.28) (QASEK?v, ASEv)| < Cllu(y)||2,64-78/25 


provided s + (1 — rd)/2 — (1-16) > —(1 — 4)r, that is, 


1 1 
(8.29) s> a r+ 37: 


in view of (8.21). 
Since E(y, x, €) is a projection, we have E(y, x, )? — E(y,a,€) € S[i° and 


Ey, 2,D)— Ely, 2, DP = Fy, D) ©OPS, 5. 


(8.30) 
o = min(rd,1— 0). 

Thus 

(8.31) QEK) =QAE+G; G(y) € OPS{;’. 


Consequently, we can write the first term on the right side of (8.27) as 
(8.32) (QAEA*v, A’ Ev) — (GA*v, AP Ev) + (Q[A®, EKolv, A° Ev). 


The last two terms in (8.32) are bounded (for each y) by 
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(8.33) Cllu(y)Ize+0-29/2- 


As for the contribution of the first term in (8.32) to the estimation of (8.25), we 
have, for each y, 


(8.34) (QAEA*v, A* Ev) = (QAA* Ev, A° Ev) + (QA[EF, A*}u, A*v), 
the last term estimable by (8.33), and 
(8.35)  2Re(QAA* Ev, A° Ev) > C1||Ev(y)||Frs+1/2 — C2||Ev(y)|lzz-, 


by (8.22) and Garding’s inequality. Keeping track of the various ingredients in the 
analysis of (8.25), we see that 


d 
Go ee > Cy||Ev(y) lipe+12 


— C2||v(y)[IFp-+a-n/2 — Call RW) Ilize, 


(8.36) 


where Cj = C;(||v||ci+r+) > 0. 
A similar analysis gives 


d 
gan ay Oh BWA ~ Be) 


< —Ci||(1 — E)v(y)|[Frossj2 + Callow) pera-o/e + CIRO) lize: 


Putting together these two estimates yields 


1 
5CilloDllier2 S CrllBo@)Iiier2 + Cull — E)e@)zor12 


(8.38)  < 5 (QA° Ev, A°E%) (QA*(1 — E)u, A*(1 — Ev) 


d 
dy 
+ C2||v(y)|[Fs+a—0)/2 + C3||R(y) lls - 


Now standard arguments allow us to replace H*+(—%)/? by H'*, with t << s. 
Then integration over y € [0,1] gives 


Cillllos41/2 S A° Lv) [ize + AP — B)v(0) I7Z2 


(8.39) 

+ Colello. + Csll Rllo,«- 
Recalling that 
(8.40) Nlollts = ATF ollZ 2c + APOyellZ2c0 


and using (8.13) to estimate 0,v, we have 
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(8.41) |lulli,s—1/2 S C|I|Eu() Ize. + ||. — E)v0) lire + llvlloe + WRMlo, 5], 


with C = C(|\v||c1+r+), provided that v € C1+"* with r > 0 and that s satisfies 
the lower bound (8.29). Let us note that 


Cy [I|A*(1 — E)o(1)IZ2 + IA Bo(O) IZ 
could have been included on the left side of (8.39), so we also have the estimate 


(8.42) ||(1 — E)v(1)||2;- + || Ev(0)||?;. < right side of (8.41). 

Having completed a first round of a priori estimates, we bring in a consid- 
eration of boundary conditions that might be imposed. Of course, the boundary 
conditions Fv(1) = fi,(1 — E)v(0) = fo are a possibility, but these are really 
a tool with which to analyze other, more naturally occurring boundary condi- 
tions. The “real” boundary conditions of interest include the Dirichlet condition 
on (8.1): 


(8.43) u0=fo, w=f, 

various sorts of (possibly nonlinear) conditions involving first-order derivatives: 

(8.44) Gj(u, Diu) = fj, aty=j (J = 0,1), 

and when (8.1) is itself a K x K system, other possibilities, which can 
be analyzed in the same spirit. Now if we write D'u = (u,0,u,0yu) = 


(A~'v,,0,A~'v1,v2), and use the paradifferential operator construction of 
Chap. 13, § 10, we can write (8.44) as 


(8.45) H,(v;x2,D)v = 9;, aty=j, 
where, given vy € C!*"t, 
(8.46) H,(uj2,€) € Ag S.C C'S) op S93. 


Of course, (8.43) can be written in the same form, with Hv = v1. 
Now the following is the natural regularity hypothesis to make on (8.45); 
namely, that we have an estimate of the form 


YO < CeO) + A = BoA) Fr. | 


(8.47) 7 


+ CD [IAs (0: 2, Dyw(A)lre + lo) [3-4]. 
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We then say the boundary condition is regular. If we combine this with (8.41) and 
(8.42), we obtain the following fundamental estimate: 


Proposition 8.1. If v satisfies the elliptic system (8.3), together with the bound- 
ary condition (8.45), assumed to be regular, then 


(8.48) llellis-1y2 S iby II9yllFr= + Welloe + WRl6,6| > 
J 


provided v © Ay 5-1/2 Cl" r > 0, and s satisfies (8.29). We can take t << s. 
In case (8.44) holds, we can replace ||q;|| rs by || f;||Hs, and in case the Dirichlet 
condition (8.43) holds and is regular, we can replace ||g;\| 1° by ||f;\|xs+1 in 
(8.48). 


Here, we have taken the opportunity to drop the “+” from C!*"*; to justify 
this, we need only shift r slightly. For the same reason, we can assume that, in 
(8.1), u € C?t", for some r > 0. In the rest of this section, we assume for 
simplicity that s — 1/2 € Zt U {0}. 

We can now easily obtain higher-order estimates, of the form 


849) oR aya SC[STllgilrorna + lO. + WRIR 1.4]; 
J 


for t << s — 1/2, by induction from 
lolle.s—1/2 = llvllg—1,s41/2 oF y0llZ—1,s—1/2 


plus substituting the right side of (8.3) for 0,,v. This follows from the existence of 
Moser-type estimates: 


|F'(-, +, w1, We) [le,s—1/2 


8.50 
OG (pualhes, feels) [evn ls—1/2 + Htealhea/2 
fork,k+s—1/2 >0.If s—1/2 € Zt U{O}, such an estimate can be established 
by methods used in § 3 of Chap. 13. 

We also obtain a corresponding regularity theorem, via inclusion of Friedrich 
mollifiers in the standard fashion. Thus replace A* by AS = A*J_ in (8.25) and 
repeat the analysis. One must keep in mind that K° must be applicable to v(y) for 
the analogue of (8.28) to work. Given (8.21), we need u(y) € H? witha > 1—r. 
However, v € C!*" already implies this. We thus have the following result. 


Theorem 8.2. Let v be a solution to the elliptic system (8.3), satisfying the bound- 
ary conditions (8.45), assumed to be regular. Assume 


(8.51) vec, r>0, 
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and 
(8.52) ge rh" x), 
with s — 1/2 € Z* U {0}. Then 
(8.53) v € Ay s—1jo(l x X). 

In particular, taking s = 1/2, and noting that 
(8.54) Hx,o(M) = H*(M), 
we can specialize this implication to 
(8.55) 9; € H*-V/7(X) => ve HAI x X), 


for k = 1,2,3,..., granted (8.51) (which makes the k = 1 case trivial). 

Note that, in (8.36)(8.38), one could replace the term || R(y)||7;. by the prod- 
uct || R(y)|] ps—1/2 - ||v(y)|| gs+1/2; then an absorption can be performed in (8.38), 
and hence in (8.39)-(8.41) we can substitute || l|5 ._ 1/9, and use || R\lz_ 4. 4/5 
in (8.49). 

We note that Theorem 8.2 is also valid for solutions to a nonhomogeneous 
elliptic system, where R in (8.13) can contain an extra term, belonging to 
Ay,-1,5—1/2, and then the estimate (8.49), strengthened as indicated above, and 
consequent regularity theorem are still valid. If (8.1) is generalized to 


(8.56) Ou = F(D2u, Dz Oyu) + f, 


then a term of the form (0, f)’ is added to (8.13). 

In view of the estimate (8.11) comparing the symbol of K with that obtained 
from the linearization of the original PDE (8.1), and the analogous result that 
holds for H;, derived from G';, we deduce the following: 


Proposition 8.3. Suppose that, at each point on OM, the linearization of the 
boundary condition of (8.44) is regular for the linearization of the PDE (8.1). 
Assume u € C+", r > 0. Then the regularity estimate (8.49) holds. In particu- 
lar, this holds for the Dirichlet problem, for any scalar (real) elliptic PDE of the 
form (8.1). 


We next establish a strengthened version of Theorem 8.2 when wu solves a 
quasi-linear, second-order elliptic PDE, with a regular boundary condition. Thus 
we are looking at the special case of (8.1) in which 
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F(y, 2, D2u, DZOyu) = — S- Bi (x,y, D'u) Oj;Oyu 
j 
(8.57) —S5 A (x,y, Diu) djOqu 
jk 
+ Fi(z,y, D'u). 


All the calculations done above apply, but some of the estimates are better. This 
is because when we derive the equation (8.13), namely, 


(8.58) 7 = K(v;y,2,Dz)u+R (REC) 


for v = (v1, v2) = (Au, Oyu), (8.7) is improved to 
(8.59) weCtrt —> K€ ASSi,+ S14" (r >0). 


Compare with (4.62). Under the hypothesis u € C!+"*+, one has the result (8.17), 
A€ C’S', which before required u € C?+"t. Also (8.20)-(8.22) now hold 
for u € C!t+"*+. Thus all the a priori estimates, down through (8.49), hold, with 
C = C(|lullci+r+). As before, we can delete the “+.” One point that must be 
taken into consideration is that, for the estimates to work, one needs u(y) € H? 
with o > 1 — r, and now this does not necessarily follow from the hypothesis 
u € C'*", Hence we have the following regularity result. Compare the interior 
regularity established in Theorem 4.5. 


Theorem 8.4. Let u satisfy a second-order, quasi-linear elliptic PDE with a 
regular boundary condition, of the form (8.45), for v = (Au, Oyu). Assume that 


(8.60) uECtTOH., r>0, rto>1. 
Then, fork = 0,1,2,..., 
(8.61) gj € H*-V/7(X) => ve HAT x X). 
The Dirichlet boundary condition is regular (if the PDE is real and scalar), and 
(8.62) u(j) = fj € H*t*(X) = ve A, -1(1 x X) 


if s > (1 —r)/2. In particular, 


u(j) = f; € fz ies 0. @ = ve H*(Ix X) 


(8.63) 
—> ue HEAT x X). 


We consider now the further special case 
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F(y, 2, D?u, DZOyu) = — > Bi (x,y, u) Oj;0yu 
(8.64) : 
_ » Al* (x,y, u) OjOpu + Fy(x,y, Dtu). 
j,k 


In this case, when we derive the system (8.58), we have the implication 


(8.65) we C™t(M) => K€ AGS}, + S17" (r>0). 


Similarly, under this hypothesis, we have A€Ors 1, and so forth. Therefore we 
have the following: 


Proposition 8.5. [f u satisfies the PDE (8.1) with F' given by (8.64), then the 
conclusions of Theorem 8.4 hold when the hypothesis (8.60) is weakened to 


(8.66) weEC’NH., r+o>l. 


Note that associated to this regularity is an estimate. For example, if wu satisfies 
the Dirichlet boundary condition, we have, for k > 2, 


(8.67) lull areca) S Ca (lel ercazy) [Itlomell xe-1/2(amry + llullzzcan], 


where we have used Poincaré’s inequality to replace the H,,,-norm of u by the 
L?-norm on the right. 

Let us see to what extent the results obtained here apply to solutions to the 
minimal surface equation produced in § 7. Recall the boundary problem (7.8): 


Ou Ou Ou _ 
Ox; Ox; Ox;Ox; 7 


(8.68) (Vu)?Au— 5° 0, u=gondd, 
ij 


where Q is a strictly convex region in R?, with smooth boundary. For this bound- 
ary problem, Theorem 8.4 applies, to yield the implication 

(8.69) g € H**1/2(90) = ue H**1(0), k=0,1,2,..., 
provided we know that 

(8.70) uEC"(O)N A e(A), r>0, rt+o>1, 

where A is a collar neighborhood of 00 in O. Now, while we know that solutions 
to the minimal surface equation are smooth inside O (having proved that minimal 


surfaces are real analytic), we so far have only continuity of a solution u on O, 
plus a Lipschitz bound on ul jo and a hope of obtaining a bound in C1(O). We 
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therefore have a gap to close to be able to apply the results of this section to 
solutions of (8.68). 

The material of the next two sections will close this gap. As we’ll see, we will 
be able to treat (8.68), not only for dim O = 2, but also for dim O = n > 2. Also, 
the gap will be closed on a number of other quasi-linear elliptic PDE. 


Exercises 


1. Suppose w is a solution to a quasi-linear elliptic PDE of the form 
y ajr (x, u)O;O,u + b(x,u, Vu) =0 on M, 
satisfying boundary conditions 
Bo(a,uju=go0, Bi(t,u,D)u=gi1, ondM, 


assumed to be regular. The operators B; have order j. Generalizing (8.67), show that, 
for any r > 0, k > 2, there is an estimate 


(8.71) 
Ilull are cary S Ce (llullor amy) (Ilgollx-1/2¢0n1) + ||91|la*-3/2(am) + lullacan)- 


2. Extend Theorem 8.4 to nonhomogeneous, quasi-linear equations, 
(8.72) S > ajx(x, D'u) 0;0,u + B(x, D'u) = h(a), 


satisfying regular boundary conditions. If one uses the Dirichlet boundary condition, 
ee = g, show that 


(8.73) lll] er® car) < Cr (lletll c+ cazy) (Ill irs—1/2¢0a1) + lll 2—2¢a0) + lull can) 


3. Give a proof of the mapping property (8.5). 
4. Prove the Moser-type estimate (8.50), when s — 1/2 = £ € Zt U {0}. (Hint. Rework 
Propositions 3.2-3.9 of Chap. 13, with H* replaced by Hx,2.) 


9. Elliptic regularity III (DeGiorgi-Nash—Moser theory) 


As noted at the end of § 8, there is a gap between conditions needed on the solution 
of boundary problems for many nonlinear elliptic PDEs, in order to obtain higher- 
order regularity, and conditions that solutions constructed by methods used so far 
in this chapter have been shown to satisfy. One method of closing this gap, that 
has proved useful in many cases, involves the study of second-order, scalar, linear 
elliptic PDE, in divergence form, whose coefficients have no regularity beyond 
being bounded and measurable. 
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In this section we establish regularity for a class of PDE Lu = f, for second- 
order operators of the form (using the summation convention) 


(9.1) Lu = b-'0;(a!*b Ogu), 


where (a/*(z)) is a positive-definite, bounded matrix and 0 < by < d(x) < by,b 
scalar, and aJ*,b are merely measurable. The breakthroughs on this were first 
achieved by DeGiorgi [DeG] and Nash [Na2]. We will present Moser’s derivation 
of interior bounds and Holder continuity of solutions to Lu = 0, from [Mo2], and 
then Morrey’s analysis of the nonhomogeneous equation Lu = f and proof of 
boundary regularity, from [Mor2]. Other proofs can be found in [GT] and [KS]. 

We make a few preliminary remarks on (9.1). We will use aJ* to define an 
inner product of vectors: 


(9.2) (V,W) = Vial" We, 
and use b dz = dV as the volume element. In case g;,(x) is a metric tensor, if 


one takes aJ* = gi* and b = g'/?, then (9.1) defines the Laplace operator. For a 
compactly supported function w, 


(9.3) (Lu, w) = — [(vuvw) dv. 


The behavior of L on a nonlinear function of u, v = f(u), plays an important 
role in estimates; we have 


(9.4) v= f(u) => Lv f'(u)Lut f"(u)|Vul?, 


where we set |V|? = (V,V). Also, taking w = w?u in (9.3) gives the following 
important identity. If Lu = g on an open set 2 and 7 € Cj (Q), then 


(9.5) [ivr dV = -2 f (WVuuv¥) a= [vs dV. 


Applying Cauchy’s inequality to the first term on the right yields the useful 
estimate 


it 
(9.6) 5 [evar dV < 2 f \uPiveP av — [ v%qu dV. 


Given these preliminaries, we are ready to present an approach to sup norm 
estimates known as “Moser iteration.” Once this is done (in Theorem 9.3 below), 
we will then tackle Hélder estimates. 

To implement Moser iteration, consider a nested sequence of open sets with 
smooth boundary 


(9.7) Qo Dias DO; > OF41 one 
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with intersection ©, as illustrated in Fig.9.1. We will make the geometrical 
hypothesisthat the distance of any point on 0Q;+1 to 0Q; is ~ Cj~?. We want 
to estimate the sup norm of a function v on © in terms of its L?-norm on Qo, 
assuming 

(9.8) v > Oisasubsolution of L (ie., Lu > 0). 

In view of (9.4), an example is 

(9.9) v=(1+u7)/?, Lu=0. 

We will obtain such an estimate in terms of the Sobolev constants 7(Q;) and C;, 
defined below. Ingredients for the analysis include the following two lemmas, the 
first being a standard Sobolev inequality. 

Lemma 9.1. For v € H'(Q;), K < n/(n— 2), 

(9.10) loca, S$ MAN [IVB@,) + leolZe@,)]- 


The next lemma follows from (9.6) if we take 7) = 1 on (0;41, tending roughly 
linearly to 0 on 0Q,;. 


Lemma 9.2. [fv > 0 is a subsolution of L, then, with Cj = C(Q;, 9541), 
(9.11) IVullz2aj41) < Gllullzz@,)- 
Under the geometrical conditions indicated above on (2;, we can assume 


(9.12) (04) <0, Cz < C7? +1). 


FIGURE 9.1 Setup for Moser Iteration 
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Putting together the two lemmas, we see that when v satisfies (9.8), 


lee 2Caj4x) S 144) [C*[o125 (0) + lola. 
< y(CF* + YllollZB(a,) 


(9.13) 


Fix « € (1,n/(n — 2)]. Now, if vu satisfies (9.8), so does 
(9.14) vj =v", 


by (9.4). Note that vj41 = Uj. Now let 


(9.15) Nj = Welles yy = luillza,9s 
SO 

(9.16) lvllz-~(o) < ee Nj. 

If we apply (9.13) to v;, we have 

(9.17) ley sallZ2copery S Yo(CF* + Dllosll 73 


201 


Note that the left side is equal to NV. 2n**" and the norm on the right is equal to 


i 
Nee, Thus (9.17) is equivalent to 


1/ni+t 
(9.18) N}ia < [o(CH +] NP. 
By (9.12), C?" + 1 < Co(j** + 1), so 
L/nitt 
lim sup N?< <T] Bree Am | No 


jroo jx0 


(9.19) 


j=0 
< K?No, 


for finite kK. This gives Moser’s sup-norm estimate: 
Theorem 9.3. [fv > 0 is a subsolution of L, then 


(9.20) llullze(o) S Kllullz2(ao) 


< (YoCo)/"- Jexp $7 eI log (74* +.1)| NG 
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where K = K(7,Co, 1). 


Holder continuity of a solution to Lu = 0 will be obtained as a consequence 
of the following “Harnack inequality.” Let B, = {a : |x| < p}. 


Proposition 9.4. Let u > 0 be a solution of Lu = 0 in Bo,. Pick co € (0,00). 
Suppose 


(9.21) meas{x € B,: u(x) >1}>cptr”. 
Then there is a constant c > 0 such that 
(9.22) u(t) >c7* in Byya. 
This will be established by examining v = f(w) with 
(9.23) f(u) = max{—log(u + €), 0}, 
where € is chosen in (0, 1). Note that f is convex, so v is a subsolution. Our first 
goal will be to estimate the L?(B,.)-norm of Vv. Once this is done, Theorem 9.3 
will be applied to estimate v from above (hence u from below) on B;,./2. 


We begin with a variant of (9.5), obtained by taking w = 7? f’(u) in (9.3). The 
identity (for smooth f) is 


(9.24) perivee dV + 2 f (ws'Vu, Vu) dV = —(Lu, wf’). 


This vanishes if Lu = 0. Applying Cauchy’s inequality to the second integral, we 
obtain 


25 fw [fw -e fw? ]IvuP av < = f [ver av. 
Now the function f (uw) in (9.23) has the property that 

(9.26) h = —e~/ is a convex function; 

indeed, in this case h(w) = max{—(u + ¢), —1}. Thus 

(9.27) f' —Of¥ seth” > 0. 


Thus f”(u)|Vul? > f’(u)?|Vul? = |Vol? if v = f(u). Taking 6? = 1/2 in 
(9.25), we obtain 


(9.28) [ever dV < af Vil? dV, 
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after one overcomes the minor problem that f’ has a jump discontinuity. If we 
pick w to = 1 on B,. and go linearly to 0 on 0B2,, we obtain the estimate 


(9.29) / |\Vul? dV < Cr®-?, 
B, 


for v = f(u), given that Lu = 0 and that (9.26) holds. 


Now the hypothesis (9.21) implies that v vanishes on a subset of B, of measure 


> Co ';". Hence there is an elementary estimate of the form 


(9.30) gor ; ve dV <Cr-” / |Vul? dV, 
B,. B, 


which is bounded from above by (9.29). Now Theorem 9.3, together with a simple 
scaling argument, gives 


(9.31) u(x)? < crn fv? dV < Ci, re Bj, 
By, 

sO 

(9.32) ute>e™, forz € Byjo, 


for all e € (0,1). Taking ¢ — 0, we have the proof of Proposition 9.4. 
We remark that Moser obtained a stronger Harnack inequality in [Mo3], by a 
more elaborate argument. In that work, the hypothesis (9.21) is weakened to 


(9.21a) sup u(a) > 1. 
B,. 


To deduce the Holder continuity of a solution to Lu = 0 given Proposition 9.4 
is fairly simple. Following [Mo2], who followed DeGiorgi, we have from (9.20) 
a bound 


(9.33) Ju(a)| << K 


on any compact subset O of Qo, given u € H'(Qo), Lu = 0. Fix xq € O, such 
that B,(xo) C O, and, for r < p, let 


(9.34) w(r) = sup u(a) — inf u(a), 
By B, 


where B,. = B,(xo). Clearly, w(p) < 2K. Adding a constant to u, we can assume 
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(9.35) sup u(#) = — inf u(x) = eat =M. 
Bp Bp 2 


Then u, = 1+ u/M and u_ = 1 — u/M are also annihilated by L. They are 
both > 0 and at least one of them satisfies the hypothesis (9.21), with r = p/2. If, 
for example, w+ does, then Proposition 9.4 implies 


(9.36) us(z)>e7* in Byya, 
sO 
(9.37) -M(1 = =) < ule) <M in Byya. 
Hence 
1 
(9.38) w(9/4) < (1- 5-)w(o), 


a 1 
(9.39) w(r) < w(p)(£) , «e — log, (1 = 5): 
c 
We state the result formally. 


Theorem 9.5. [fu € H+(Qo) solves Lu = 0, then for every compact O in Qo, 
there is an estimate 


(9.40) lull ca(o) < Cllull z2(Q¢)- 
It will be convenient to replace (9.40) by an estimate involving Morrey spaces, 


which are discussed in Appendix A at the end of this chapter. We claim that under 
the hypotheses of Theorem 9.5, 


(9.41) Vuln €M?, p= 


1-a’ 
where the Morrey space M? consists of functions f satisfying the q = 2 case of 
(A.2). The property (9.41) is stronger than (9.40), by Morrey’s lemma (Lemma 


A.1). To see (9.41), if Br is a ball of radius R centered at y, Bor C Q, then let 
c = u(y) and replace u by u(a) — c in (9.6), to get 


1 
5 f evuP av <2 f |u(z) - cP IVuP av. 


Taking 7 = 1 on Br, going linearly to 0 on OBgp, gives 
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(9.42) / iVule dv < CRO, 
Br 
as needed to have (9.41). 
So far we have dealt with the homogeneous equation, Lu = 0. We now turn to 
regularity for solutions to a nonhomogeneous equation. We will follow a method 


of Morrey, and Morrey spaces will play a very important role in this analysis. We 
take L as in (9.1), with a/ k measurable, satisfying 


(9.43) 0 < rolél? < So aF*(a)EjEx < ArlEl?, 

while for simplicity we assume b, b~! € Lip(Q). We consider a PDE 
(9.44) Lu = f. 

It is clear that, for u € Hj (Q), 

(9.45) (Lu,u) > CS~ |ldjullZ2, 

so we have an isomorphism 

(9.46) Dic BO}! a): 


Thus, for any f € H~1+(Q), (9.44) has a unique solution u € H4(Q). One can 
write such f as 


(9.47) f=) 56593, 9; € L7(Q). 
The solution u € Hj(() then satisfies 


(9.48) llullncay < CD Igillze- 


Here C' depends on 2, Ao, A1, and b € Lip({). 
One can also consider the boundary problem 


(9.49) Iv=0o0nQ, v=w ondQ, 
given w € H1'(Q), where the latter condition means v — w € Hj(Q). Indeed, 
setting v = u + w, the equation for u is Lu = —Lw, u € Hj (Q). Thus (9.49) is 


uniquely solvable, with an estimate 


(9.50) lVullz2a) < Cl|lVwllz2(a), 
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where C has a dependence as in (9.48). 
Our present goal is to give Morrey’s proof of the following local regularity 
result. 


Theorem 9.6. Suppose u € H'(Q) solves (9.44), with f = S>0j9;, 9; € 
M$(Q), q > n, that is, 


n—-2+2p n 
av < K2(4 me 
(9.51) Joi dv < Ki(>) , p=1-7€1) 
By, 


Assume L is of the form (9.1), where the coefficients aJ* satisfy (9.43) and b, b~! € 


Lip(Q). Let O CC Q, and assume pp < [49 = @, for which Theorem 9.5 holds. 
Then u € C(O); more precisely, Vu € M3(O), that is, 


n—2+2p 
2 < K2 nd 
(9.52) [ive dV < K? (}) 


Br 


Morrey established this by using (9.48), (9.50), and an elegant dilation argu- 
ment, in concert with Theorem 9.5. For this, suppose Br = Br(y) C © for each 
y € O. We can write u = U + H on Br, where 


5% LU = 5° jg; on Br, U € Hj(Br), 
LH=OonBr, H-—wué Hj(Br), 

and we have 

(9.54) |[VU||z2(Bp) S Cillgllzz(ee), VA lle2cBg) S< C2l|Vullz2(e,), 

where ||g||7.2 = >> ||9;||72- Let us set 


(9.55) Fil = FP llz2¢e,)- 


Also let &(g;, 2) be the best constant Ky for which (9.51) is valid forO <r < R. 
If g-(%) = g(r2), note that 


Kon? “S)=7" 74,8). 
Now define 


y(r) = sup{||VU||-s : U € Ho(Bs), LU = S95, on Bs, 


(9.56) 
n(gj,5) <1,0<S < R}. 


9. Elliptic regularity III (DeGiorgi-Nash—Moser theory) 207 


Let us denote by yg(r) the sup in (9.56) with S fixed, in (0, R]. Then ys(r) 
coincides with yr(r), with L replaced by the dilated operator, coming from the 
dilation taking By to Br. More precisely, the dilated operator is 


(9.57) Ls = bg 0; al* b5' Oe, 


with 
ais'(x) =a!*(SR-1z), bg(x) = b(SR-'2), 


assuming 0 has been arranged to be the center of Br. To see this, note that if 
7= S/R, U;(2) =7 “U0 (e2), and g5-(2) = 9;(72), then 


(9.58) LU = S° 59; => LgU, = D~ Ojgjr- 


Also, VU, (2) = (VU)(r2), so ||VU-||s/- = 7"/2||VU||s. 

Now for this family Ls, one has a uniform bound on C in (9.48); hence y(r) 
is finite for r € (0,1]. We also note that the bounds in (9.40) and (9.42) are 
uniformly valid for this family of operators. Theorem 9.6 will be proved when we 
show that 


(9.59) g(r) < Art/?-He, 


In fact, this will give the estimate (9.52) with u replaced by U; meanwhile such 
an estimate with u replaced by H is a consequence of (9.42). Let H satisfy (9.42) 
with a = Lg. We take ps < jug. 

Pick S' € (0, R] and pick g; satisfying (9.51), with R replaced by S and K, by 
K. Write the U of (9.53) as U = Us + Hg on Bg, where Us € H}(Bs), LUs = 
LU = 5° 0;9; on Bg. Clearly, (9.51) implies 


(9.60) / Ig? aV < «(sy a 


B, 
Thus, as in (9.54) (and recalling the definition of y), we have 


S\n/2-l+pu 
IVUslls < AiK(5) 


? 


(9.61) 9 
|VHslls < AalVUl|s < A2Ke(S). 


Now, suppose 0 <r < S < R. Then, applying (9.42) to Hs, we have 


|[VU | < ||VUs||p + VAslr 


Oe (G) EL) ane) 
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Therefore, setting s = r/R, t = S/R, we have the inequality 


(9.63) ols) < #2 MH¥9(2) + Asy(t) (ES), 


valid for 0 < s <t < 1. Since it is clear that (1) is monotone and finite on (0, 1], 
it is an elementary exercise to deduce from (9.63) that y(r) satisfies an estimate 
of the form (9.59), as long as pu < jug. This proves Theorem 9.6. 

Now that we have interior regularity estimates for the nonhomogeneous prob- 
lem, we will be able to use a few simple tricks to establish regularity up to the 
boundary for solutions to the Dirichlet problem 


(9.64) Lu= 5° 0;9;, u=f onagQ, 


where L has the form (9.1), © is compact with smooth boundary, f € Lip(0Q), 


and g; € L4(Q), with q > n. First, extend f to f € Lip(Q). Then u = v + f, 
where v solves 


(9.65) Lv = S- d;hj, v=0o0naQ, 
where 
(9.66) Ojh; = 0j9; — b~10;(a7*b Oxf). 


We will assume b € Lip(Q2); then A; can be chosen in L4 also. 

The class of equations (9.65) is invariant under smooth changes of variables 
(indeed, invariant under Lipschitz homeomorphisms with Lipschitz inverses, hav- 
ing the further property of preserving volume up to a factor in Lip(Q)). Thus make 
a change of variables to flatten out the boundary (locally), so we consider a solu- 
tion v € H' to (9.65) in x, > 0, |x| < R. We can even arrange that b = 1. Now 
extend v to negative x,,, to be odd under the reflection x, ++ —2,. Also extend 
a? (a) to be even when j, k < n or j = k = n, and odd when j or k = n (but not 
both). Extend h; to be odd for 7 < n and even for 7 = n. With these extensions, 
we continue to have (9.65) holding, this time in the ball || < R. Thus interior 
regularity applies to this extension of v, yielding Hélder continuity. The following 
is hence proved. 


Theorem 9.7. Let u € H'(Q) solve the PDE 
(9.67) $b 10; (a*b Ogu) = jg; onQ, w= f on dQ. 
Assume gj € L4(Q) with q > n =dimQ, and f € Lip(0Q). Assume that b, b~' € 


Lip(Q) and that (a1*) is measurable and satisfies the uniform ellipticity condition 
(9.43). Then u has a Holder estimate 
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(9.68) Iellou@y S C1 (~ Ilojllza(ay + I Flluincomy)- 


More precisely, if 4 = 1—n/q € (0,1) is sufficiently small, then Vu belongs to 
the Morrey space M4(Q), and 


(9.69) |Vullazzcay < C2 63 IlgsIlza(a) + Flluncon))- 
In these estimates, Cj = C;(Q, A1, A2, 6). 

So far in this section we have looked at differential operators of the form 
(9.1) in which (a?) is symmetric, but unlike the nondivergence case, where 
a* (x) 0;0,u = a*)(x) 0;O,u, nonsymmetric cases do arise; we will see an 
example in § 15. Thus we briefly describe the extension of the analysis of (9.1) to 


(9.70) Lu = b-' 0; ([a?* + w!*]b Ogu). 


We make the same hypotheses on a/* (a) and b(x) as before, and we assume («*) 
is antisymmetric and bounded: 


(9.71) wiF (2) = —w (a), wiF © £°(M). 


We thus have both a positive symmetric form and an antisymmetric form defined 
at almost all x € 2: 


(9.72) (V,W) = Vja)*(2)W,, [V, W] = Vjw?* (2) We. 
We use the subscript L? to indicate the integrated quantities: 
(9.73) (uv, Ww) 22 = fom dV, |v, w]r2 = feo dV. 
Then, in place of (9.3), we have 

(9.74) (Lu, w) = —(Vu, Vw) p2 — [Vu, Vu zz. 


The formula (9.4) remains valid, with |Vu|? = (Vu, Vu), as before. Instead of 
(9.5), we have 


(9.75) 7, w?|Vul? dV = —2(pVu, uVy) 22 —2[yVu, uV a] 22 — ; wgu dV, 
when Lu = g on Q and w € Cd (Q). This leads to a minor change in (9.6): 


0.7) 5 f vtivuP av < e+) [luPivePav— f wquav. 
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where Cy is determined by the operator norm of (w4"), relative to the inner prod- 
uct (, ). 

From here, the proofs of Lemmas 9.1 and 9.2, and that of Theorem 9.3, go 
through without essential change, so we have the sup-norm estimate (9.20). In the 
proof of the Harnack inequality, (9.24) is replaced by 


/ wf" |Wul? dV + 2b f'Vu, Ve) 2 + wb f'Vu, Vabz2 
= —(Lu, wf’). 


(9.77) 


Hence (9.25) still works if you replace the factor 1/6? by (1 + C,)/6?, where 
again C; is estimated by the size of (w/*). Thus Proposition 9.4 extends to our 
present case, and hence so does the key regularity result, Theorem 9.5. Let us 
record what has been noted so far: 


Proposition 9.8. Assume Lu has the form (9.70), where (a)*) and b satisfy the 
hypotheses of Theorem 9.5, and (w4") satisfies (9.71). If u € H*(Qo) solves 
Lu = 0, then, for every compact O C Qo, there is an estimate 


(9.78) Ilullca(o) < Cllullz2 ao): 


The Morrey space estimates go through as before, and the analysis of (9.64) is 
also easily modified to incorporate the change in L. Thus we have the following: 


Proposition 9.9. The boundary regularity of Theorem 9.7 extends to the opera- 
tors L of the form (9.70), under the hypothesis (9.71) on (w*). 


Exercises 


1. Given the strengthened form of the Harnack inequality, in which the hypothesis (9.21) is 
replaced by (9.21a), produce a shorter form of the argument in (9.33)-(9.40) for Hélder 
continuity of solutions to Lu = 0. 

2. Show that in the statement of Theorem 9.7, 5> 0; gj in (9.67) can be replaced by 


h+ > _ d59;, Gi € L7(Q), he L*(Q), q>n, p> - 


(Hint: Write h = 5° O;h; for some hj € L4(Q).) 
3. With L given by (9.1), consider 


Ly =L+X, X=) ° A;j(a) dj. 
Show that in place of (9.4) and (9.6), we have 
v= f(u) => Liv = f'(u)Liut f"(u)|Vul? 


and 
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sf viver dv < [ (aver i: 2Ay?) |ul? dV — [ udan) dV, 
where A(x)? = S> Aj(z)?. 


Extend the sup-norm estimate of Theorem 9.3 to this case, given Aj € L°°(Q). 
4. With L given by (9.1), suppose wu solves 


Lut+ S > 0; (Aj (x)u) +C(x2)u=g onQ ER”. 
Supppose we have 
Ay € L1(0), CE L7(9), gE LP(Q), P>E, a>n, 
and suppose we also have 
l[eull x72) + llullc-ocay < K, uly = f € Lip(OQ). 


Show that, for some p > 0, wu € C#(Q). (Hint: Apply Theorem 9.7, together with 
Exercise 2.) 


10. The Dirichlet problem for quasi-linear elliptic equations 


The primary goal in this section is to establish the existence of smooth solutions 
to the Dirichlet problem for a quasi-linear elliptic PDE of the form 


(10.1) S > Fojn.(Vu)djxu = 00nQ, u=yondn. 


More general equations will also be considered. As noted in (7.32), this is the 
PDE satisfied by a critical point of the function 


(10.2) I(u) = | F(Vu) dx 
| 


defined on the space 
1 1/9) - 4, — 
Vo = {ue A (Q):u=y on OO}. 
Assume y € C®(Q). We assume F is smooth and satisfies 


(10.3) Ai(p)IEl? < S° Fospn (PEER < Aa(P)IEl?, 


with A; : R" — (0,00), continuous. 
We use the method of continuity, showing that, for each 7 € [0,1], there is a 
smooth solution to 
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(10.4) 6,(D?u) =0onQ, u=y,ondQ, 
where ©,(D?u) = ©(D?u) is the left side of (10.1) and y, = y. We arrange a 
situation where (10.4) is clearly solvable for 7 = 0. For example, we might take 
yr = y and 
(10.5) ©,(D?u) = r®(D?u) + (L—T)Au= S— AI (Vu) 0;Oqu, 
with 
. 1 

(10.6) At*(p) = 9p, 9p, [TF (p) + 5(1 —7)IpP I. 
Another possibility is to take 
(10.7) @,(D?u) = 6(D?u),  ,(x) = TYy(2), 
since at T = O we have the solution u = 0 in this case. 

Let J be the largest interval containing {0} such that (10.7) has a solution 
u =u, € C*(Q) for each r € J. We will show that J is all of [0, 1] by showing 
it is both open and closed in [0,1]. We will deal specifically with the method 
(10.5)-(10.6), but a similar argument can be applied to the method (10.7). 

Demonstrating the openness of J is the relatively easy part. 
Lemma 10.1. Jf 70 € J, then, for some € > 0, [T0,T) +¢€) C J. 
Proof. Fix k large and define 
(10.8) W : [0,1] x VE — HF-7(Q) 
by W(7,u) = ®,(D?u), where 
(10.9) VE = {ue H*(Q) : u= yon OO}. 


This map is C', and its derivative with respect to the second argument is 


(10.10) D2V(7,u)u = Ly, 

where 

(10.11) L:Vf = H* noi at (0) 
is given by 


(10.12) Lu = 5) Oj; AI (Vu(a)) Opv. 
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L is an elliptic operator with coefficients in C°(Q) when u = u,,, clearly an 
isomorphism in (10.11). Thus, by the inverse function theorem, for 7 close enough 
to To, there will be u,, close to u,,, such that U(7,u,) = 0. Since u, € H*(Q) 
solves the regular elliptic boundary problem (10.4), if we pick & large enough, we 
can apply the regularity result of Theorem 8.4 to deduce u, € C™(Q). 


The next task is to show that J is closed. This will follow from a sufficient a 
priori bound on solutions u = u,, T € J. We start with fairly weak bounds. First, 
the maximum principle implies 


(10.13) lull to ¢azy = |l¢llz-aan), 


for eachu = u,, TE J. 
Next we estimate derivatives. Each we = Ogu satisfies 


(10.14) S| 0; A7*(Vu)Opwe = 0, 


where AJ* (Vu) is given by (10.6); we drop the subscript rT. 
The next ingredient is a “boundary gradient estimate,” of the form 


(10.15) \Vu(x)| < K, for x € oQ, 


As we have seen in the discussion of the minimal surface equation in § 7, whether 
this holds depends on the nature of the PDE and the region /. For now, we will 
make (10.15) a hypothesis. Then the maximum principle applied to (10.14) yields 
a uniform bound 


(10.16) \|Vull ta) <K. 

For the next step of the argument, we will suppose for simplicity that Q = 
T”—! x [0,1], for the present, and discuss the modification of the argument for 
the general case later. Under this assumption, in addition to (10.14), we also have 
(10.17) we = Opp on OD, forl << l<n—-1, 
since Of is tangent to 00 forl << n-—1. 

Now we can say that Theorem 9.7 applies to ug = Ogu, forl < €<n-—1. 
Thus there is an r > O for which we have bounds 
(10.18) lwellon@ SK, 1<e<n-1. 


Let us note that Theorem 9.7 yields the bounds 


(10.19) Vwelluecay <K’, 1<e<n-1, 
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which are more precise than (10.18); here 1 — r = n/p. Away from the bound- 
ary, such a property on all first derivatives of a solution to (10.1) leads to the 
applicability of Schauder estimates to establish interior regularity. 

In the case of examining regularity at the boundary, more work is required since 
(10.18) does not include a derivative 0, transverse to the boundary. Now, using 
(10.4), we can solve for 02.u in terms of 0;0,u, forl<jcn, l<k<n-1. 
This will lead to the estimate 


(10.20) lullornn@ay SK, 


as we will now show. 
In order to prove (10.20), note that, by (10.19), 


(10.21) O,Oeu € MP(Q), forl<£<n-1,1<k<n, 
where p € (n,oo) and r € (0,1) are related by 1 — r = n/p. Now the PDE 


(10.4) enables us to write O2u as a linear combination of the terms in (10.21), 
with L°°(Q)-coefficients. Hence 


(10.22) Ou € M3 (Q), 
NiO) 
(10.23) V(Onu) € MP(Q) c MP(Q). 


Morrey’s lemma (Lemma A.1) states that 


(10.24) Vv e MQ) vec") ifr=1- - E (0,1). 
Thus 
(10.25) dnu € C7 (M), 


and this together with (10.18) yields (10.20). From this, plus the Morrey space 
inclusions (10.21)-(10.22), we have the hypothesis (8.60) of Theorem 8.4, with 
r > 0 ando = 1. Thus, by Theorem 8.4, and the associated estimate (8.73), we 
deduce estimates 


(10.26) lull ea) S Ke, 
fork = 2,3,.... Therefore, if [0,7) C J,as tT, 7 71, we can pick a subsequence 


of u,,, converging weakly in H*+!(Q), hence strongly in H*(Q). If k is picked 
large enough, the limit wu; is an element of H*+1(Q), solving (10.4) for 7 = 
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7, and furthermore the regularity result Theorem 8.4 is applicable; hence uy € 
C™(Q). This implies that J is closed. 

Hence we have a proof of the solvability of the boundary problem (10.1), for 
the special case Q = T”~! x [0, 1], granted the validity of the boundary gradient 
estimate (10.15). 

As noted, to have 0,1 < £ < n—1, tangent to 0M, we required Q = 
T”-! x [0,1]. For Q Cc R”, if X = >> bed is a smooth vector field tangent to 
OQ, then ux = Xu solves, in place of (10.14), 


(10.27) S "0; A7*(Vu) deux = > O;F;, 


with F; € L® calculable in terms of Vu. Thus Theorem 9.7 still applies, and the 
rest of the argument above extends easily. We have the following result. 


Theorem 10.2. Let F : R” — R be a smooth function satisfying (10.3). Let 
Q Cc R” be a bounded domain with smooth boundary. Let p € C™°(0Q). Then 
the Dirichlet problem (10.1) has a unique solution u € C%(Q), provided the 
boundary gradient estimate (10.15) is valid for all solutions u = u, to (10.4), for 
T € [0,1]. 


Proof. Existence follows from the fact that J is open and closed in [0, 1], and 


nonempty, as 0 € J. Uniqueness follows from the maximum principle argument 
used to establish Proposition 7.2. 


Let us record a result that implies uniqueness. 


Proposition 10.3. Let (2 be any bounded domain in R". Assume that uy € 
C%(Q) AN C(Q) are real-valued solutions to 


(10.28) G(Vu,,07u,) =OonQ, uy, = g, on OQ, 


forv = 1,2, where G = G(p,¢), ¢ = (Cjx). Then, under the ellipticity hypothesis 


OG 

(10.29) S > a— (0,6) Gx = A(p)IEP? > 0, 
OGjk 

we have 

(10.30) gu <g2 on 02 = wy < ug on. 


Proof. Same as Proposition 7.2. As shown there, v = u2—u satisfies the identity 
Lv = G(Vuz,07u2) — G(Vuyz,07uz), and L satisfies the conditions for the 
maximum principle, in the form of Proposition 2.1 of Chap. 5, given (10.29). 


It is also useful to note that we can replace the first part of (10.28) by 
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(10.31) G(Vuz2, 0?u2) < G(Vur, 07u1), 


and the maximum principle still yields the conclusion (10.30). 

Since the boundary gradient estimate was verified in Proposition 7.5 for the 
minimal surface equation whenever 2 C R? has strictly convex boundary, we 
have existence of smooth solutions in that case. In fact, the proof of Proposition 
7.5 works when Q C R” is strictly convex, so that OQ has positive Gauss 
curvature everywhere. We hence have the following result. 


Theorem 10.4. Jf Q C R” is a bounded domain with smooth boundary that is 
strictly convex, then the Dirichlet problem 


Ou Ou 0?u 
(10.32) Au) be, On, 02j0% =0, w= gondQ, 


for a minimal hypersurface, has a unique solution u € C%*(Q), given 


g € C™(A). 


In Proposition 7.1, it was shown that when n = 2, the equation (10.32) has a 
solution u € C%°(Q) A C(Q), and Proposition 7.2 showed that such a solution 
must be unique. Hence in the case n = 2, Theorem 10.4 implies the regularity at 
OQ. for this solution, given yp € C™ (OQ). 

We now look at other cases where the boundary gradient estimate can be ver- 
ified, by extending the argument used in Proposition 7.5. Some terminology is 
useful. Let us be given a nonlinear operator F'(D?u), and g € C°°(0Q). We say 
a function By € C?(Q) is an upper barrier at y € OQ (for g), provided 


F(D?B,)<00o9, By, e€c'(9), 


(10.33) 
By >gondQ, Bi(y) =g(y). 


Similarly, we say B_ € C?(Q) is a lower barrier at y (for g), provided 


F(D?B_)>0o9, B_€c'(Q), 
(10.34) 
B_<gondQ, B_(y) = gly). 
An alternative expression is that g has an upper (or lower) barrier at y. Note well 
the requirement that B. belong to C'(Q). We say g has upper (resp., lower) bar- 
riers along OC if there are upper (resp., lower) barriers for g at each y € OQ, with 
uniformly bounded C!(Q)-norms. The following result parallels Proposition 7.5. 


Proposition 10.5. Let Q C R” be a bounded region with smooth boundary. 
Consider a nonlinear differential operator of the form F(D?u) = G(Vu, 07u), 
satisfying the ellipticity hypothesis (10.29). Assume that g has upper and lower 
barriers along OQ, whose gradients are everywhere bounded by K. Then a 
solution u € C?(Q) N C(Q) to F(D?u) = 0, u = g on OQ, satisfies 
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(10.35) |u(y) — u(x)|<2K|y—a2], yea, ce. 
Ifu € C7(Q) A. C1(Q), then 
(10.36) |Vu(z)|<2k, ren. 


Proof. Same as Proposition 7.5. If Bi, are the barriers for g at y € OQ, then 


B_y(x) <u(z) < Byy(2), ee, 


which readily yields (10.35). Note that we = Ogu satisfies the PDE 
OG OG 
10.37 —— 0,0 — Oj;we =0 Q 
( ) S- OCjk J k We ae SS Op; 7 We on sc, 


so the maximum principle yields (10.36). 


Now, behind the specific implementation of Proposition 7.5 is the fact that 
when OQ is strictly convex and g € C™°(0Q), there are linear functions B,,, 
satisfying B_y < g < Byyon dQ, B_,y(y) = g(y) = B+y(y), with bounded 
gradients. Such functions By, are annihilated by operators of the form (10.1). 
Therefore, we have the following extension of Theorem 10.4. 


Theorem 10.6. Jf 2 C R” is a bounded domain with smooth boundary that 
is strictly convex, then the Dirichlet problem (10.1) has a unique solution wu € 
C™(Q), given yp € C® (AQ), provided the ellipticity hypothesis (10.3) holds. 


We next consider the construction of upper and lower barriers when F'(D?u) = 
>> AJ* (Vu) 0;0,u satisfies the uniform ellipticity condition 


(10.38) AolEl? < So AM (Eee < ALl€l?, 


for some A; € (0, 00), independent of p. Given z € R”, R = |y—z|,a € (0,00), 
set 


(10.39) Ey2(&) = eer g OR? 2 ln—2l?. 


A calculation, used already in the derivation of maximum principles in §2 of 
Chap. 5, gives 


Al™(p) 0;04Ey,z(2) 
(10.40) 
=e [40?AI* (p)(wy — 24) (te — ze) — 20.4 j(p)] 


Under the hypothesis (10.38), we have 
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(10.41) S| AP (p) Oj Ox-By,2(2) > 2ae~°"" [2aro|a — 2|? — rrr] - 


To make use of these functions, we proceed as follows. Given y € OQ, pick 
z = 2(y) € R” \ Q such that y is the closest point to z on 2. Given that 12 is 
compact and 02 is smooth, we can arrange that |y — z| = R, a positive constant, 
with the property that R~! is greater than twice the absolute value of any principal 
curvature of O() at any point. Note that, for any choice of a > 0, Ey,.(y) = 0 and 
Ey, .(x) < 0 for x € 2 \ {y}. From (10.41) we see that if a is picked sufficiently 
large (namely, a > nA, /2R?Ao), then 


(10.42) S| A?*(p) 0j0xBy,2(t) >0, 2 ED, 


for all p, since |2 — z| > R. Now, given g © C®(0Q), we can find K € (0,00) 
such that, for all x € OQ, 


(10.43) Bay(&) = o(y) F KE,,. (x) => B_, (a) < g(x) < Byy(z). 


Consequently, we have upper and lower barriers for g along 02. Therefore, we 
have the following existence theorem. 


Theorem 10.7. Let Q C R” be any bounded region with smooth boundary. If the 
PDE (10.1) is uniformly elliptic, then (10.1) has a unique solution u € C%°(Q) 
for any p € C™®(AQ). 


Certainly the equation (10.32) for minimal hypersurfaces is not uniformly 
elliptic. Here is an example of a uniformly elliptic equation. Take 


2 
(10.44) = F(p) = (v1 +p? - a) = |p|? — 2a,/1 + [p? +14 02, 


with a € (0,1). This models the potential energy of a stretched membrane, say 
a surface S C R®, given by z = u(x), with the property that each point in 9 is 
constrained to move parallel to the z-axis. Compare with (1.5) in Chap. 2. 

It is also natural to look at the variational equation for a stretched membrane 
for which gravity also contributes to the potential energy. Thus we replace F'(p) 
in (10.44) by 


(10.45) F*(u,p) = F(p) + au, 


where a is a positive constant. This is of a form not encompassed by the class 
considered so far in this section. The PDE for wu in this case has the form 


(10.46) div F*(u, Vu) — FF (u, Vu) =0, 


which, when F’#(u, p) has the form (10.45), becomes 
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(10.47) SS Fojp, (Vu) 0j0,u — a = 0. 


We want to extend the existence argument to this case, to produce a solution u € 
C*(Q), with given boundary data yp € C%°(0Q). Using the continuity method, 
we need estimates parallel to (10.13)-(10.20). Now, since a > 0, the maximum 
principle implies 


(10.48) sup u(z) = sup (yy). 
rE yea 


To estimate ||u|| 2, we also need control of infg u(x). Such an estimate will 
follow if we obtain an estimate on ||Vu||;~©(q). To get this, note that the equation 
(10.14) for we = Oeu continues to hold. Again the maximum principle applies, so 
the boundary gradient estimate (10.15) continues to imply (10.16). Furthermore, 
the construction of upper and lower barriers in (10.39)—(10.43) is easily extended, 
so one has such a boundary gradient estimate. 

Now one needs to apply the DeGiorgi-Nash—Moser theory. Since (10.14) con- 
tinues to hold, this application goes through without change, to yield (10.20), and 
the argument producing (10.26) also goes through as before. Thus Theorem 10.7 
extends to PDE of the form (10.47). 

One might consider more general force fields, replacing the potential energy 
function (10.45) by 


(10.49) F#(u,p) = F(p) + V(u). 

Then the PDE for u becomes 

(10.50) S > Fojp,(Vu)djOnu — V'(u) = 0. 

In this case, we = Ocu satisfies 

(10.51) S/d; A7*(Vu)Opwe — V" (u)we = 0. 

This time, we won’t start with an estimate on ||w||_-©, but we will aim directly for 
an estimate on ||Vw|| 2°, which will serve to bound ||u||,-., given that u = y on 
On. 


The maximum principle applies to (10.51), to yield 


(10.52) |Vullz-~(a) = sup |Vu(y)|, provided V’(u) > 0. 
yea 


Next, we check whether the barrier construction (10.39)-(10.43) yields a bound- 
ary gradient estimate in this case. Having (10.43) (with g = ~), we want 


(10.53) H(D*B,,) <H(D*u) < H(D* B_,) 00 Q, 
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in place of (10.42), where H(D?u) is given by the left side of (10.50), and we 
want this sequence of inequalities together with (10.43) to yield 


(10.54) B_,(z) <u(#) < By,(z), we. 


To obtain (10.53), note that we can arrange the left side of (10.42) to exceed a 
large constant, and also a large multiple of E,,,(a). Note that the middle quentity 
in (10.53) is zero, so we want H(D?B,,) < 0 and H(D?B_,) > 0, on Q. We 
can certainly achieve this under the hypothesis that there is an estimate 


(10.55) |V’(u)| < Ai + Agu. 


In such a case, we have (10.53). To get (10.54) from this, we use the following 
extension of Proposition 10.3. 


Proposition 10.8. Let 2 C R” be bounded. Consider a nonlinear differential 
operator of the form 


(10.56) H (a, D?u) = G(2, u, Vu, 07), 

where G(x, u, p, ¢) satisfies the ellipticity hypothesis (10.29), and 

(10.57) OuG(a,u,p,¢) <0. 

Then, given u, € C?(Q) NC(Q), 

(10.58) H(D?uz) < H(D?u1) onQ, uy < ug on OD = uy < ug on O. 


Proof. Same as Proposition 10.3. For the relevant maximum principle, replace 
Proposition 2.1 of Chap. 5 by Proposition 2.6 of that chapter. 


To continue our analysis of the PDE (10.50), Proposition 10.8 applies to give 
(10.53) = (10.54), provided V’(u) > 0. Consequently, we achieve a bound on 
|| Vu] pq), and hence also on ||u|| ;-0(@), provided V (1) satisfies the hypotheses 
stated in (10.52) and (10.55). 

It remains to apply the DeGiorgi-Nash—Moser theory. In the simplified case 
where Q = T"~! x [0,1], we obtain (10.18), this time by regarding (10.51) 
as a nonhomogeneous PDE for wy, of the form (9.67), with one term 0; 95> 
namely 0:V’(u). The L°-estimate we have on u is more than enough to apply 
Theorem 9.7, so we again have (10.18)-(10.19). Next, the argument (10.21)— 
(10.23) goes through, so we again have (10.20) and the Morrey space inclusions 
(10.21)-(10.22). Hence the hypothesis (8.60) of Theorem 8.4 holds, with r > 0 
and o = 1. Theorem 8.4 yields 


(10.59) llull rea) < Ke, 
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and a modification of the argument parallel to the use of (10.27) works for 
Qc R”. 
The estimates above work for 


(10.60) > Fy jp, (VujdjO,u — TV'(u) + (1 —7)Au =0, tl =Y, 


for all 7 € [0, 1]. Also, each linearized operator is seen to be invertible, provided 
V"(u) > 0. Thus all the ingredients needed to use the method of continuity are 
in place. We have the following existence result. 


Proposition 10.9. Let Q C R” be any bounded domain with smooth boundary. If 
the PDE 


(10.61) SOF pipe V U)Oj;O,U — V'(u)=0, u=yonaQ, 
is uniformly elliptic, and if V'(u) satisfies 

(10.62) |V"(u)| < Art Aalul, V"(u) > 0, 

then (10.61) has a unique solution u € C®(Q), given p € C%(0Q). 


Consider the case V(u) = Au?. This satisfies (10.62) if A > 0 but not if 
A <0. The case A < 0 corresponds to a repulsive force (away from u = 0) 
that increases linearly with distance. The physical basis for the failure of (10.61) 
to have a solution is that if u(a) takes a large enough value, the repulsive force 
due to the potential V cannot be matched by the elastic force of the membrane. If 
Fy, p, (p) is independent of p and 2A < 0 is an eigenvalue of the linear operator 

Fy; p;,9; Or, then certainly (10.61) is not solvable. 

On the other hand, if V(u) = Au? withO > A > —o, where fo is less 
than the smallest eigenvalue of all operators )> A?* 0,0, with coefficients satis- 
fying (10.38), then one can still hope to establish solvability for (10.61), in the 
uniformly elliptic case. We will not pursue the details on such existence results. 

We now consider more general equations, of the form 


(10.63) H(D?u) =S°R pipe VU) O;O,u + g(a,u, Vu) =0, u an = ¥- 
Consider the family 
(10.64) H,(D?u) = S— Fy,p, (Vu) jOnut+79(2,u,Vu) =0, ula = 79. 


We will prove the following: 


Proposition 10.10. Assume that the equation (10.63) satisfies the ellipticity con- 
dition (10.3) and that 0,g(x,u,p) < 0. Let Q C R” be a bounded domain with 
smooth boundary, and let p € C®° (OQ) be given. Assume that, for T € [0,1], any 
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solution u = u, to (10.64) has an a priori bound in C1(Q). Then (10.63) has a 
solution u € C™®(Q). 


Proof. For wz = Ogu, we have, in place of (10.14), 
(10.65) S "0; A7* (Vu) Owe = —Oeg(x,u, Vu). 


The C!-bound on u yields an L®-bound on g(x, u, Vu), so, as in the proof of 
Proposition 10.9, we can use Theorem 9.7 and proceed from there to obtain high- 
order Sobolev estimates on solutions to (10.64). 

Thus the largest interval J in [0,1] that contains r = 0 and such that (10.64) 
is solvable for all 7 € J is closed. The hypothesis 0g < 0 implies that the 
linearized equation at 7 = 79 is uniquely solvable, so, as in Lemma 10.1, J is 
open in [0, 1], and the proposition is proved. 


A simple example of (10.63) is the equation for a surface z = u(x) of given 
constant mean curvature 7: 


(10.66) (Vu)~3 [(vu)2Au ~ D?u(Vu, Vu)| 4+nH=0, u=yonaQ, 


which is of the form (10.63), with F(p) = (1 + |p|?)'/? and g(a, u,p) = nH. 
Note that members of the family (10.64) are all of the same type in this case, 
namely equations for surfaces with mean curvature 7H. We see that Proposition 
10.3 applies to this equation. This implies uniqueness of solutions to (10.66), 
provided they exist, and also gives a tool to estimate L°°-norms, at least in some 
cases, by using equations of graphs of spheres of radius 1/H as candidates to 
bound u from above and below. We can also use such functions to construct bar- 
riers, replacing the linear functions used in the proof of Proposition 7.5. This 
change means that the class of domains and boundary data for which upper and 
lower barriers can be constructed is different when H ¥ 0 than it is in the minimal 
surface case H = 0. 

Note that if u solves (10.66), then wy = Ogu solves a PDE of the form (10.14). 
Thus the maximum principle yields ||Vul|z<(q) = supag |Vu(y)|. Conse- 
quently, we have the solvability of (10.66) whenever we can construct barriers 
to prove the boundary gradient estimate. 

The methods for constructing barriers described above do not exhaust the 
results one can obtain on boundary gradient estimates, which have been pushed 
quite far. We mention a result of H. Jenkins and J. Serrin. They have shown that the 
Dirichlet problem (10.66) for surfaces of constant mean curvature H is solvable 
for arbitrary p € C™(0Q) if and only if the mean curvature x(y) of OQ Cc R” 
satisfies 


(10.67) x(y) > lH, Vy € On. 
ce 
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In the special case n = 2, H = 0, this implies Proposition 7.3 in this chapter. 
See [GT] and [Se2] for proofs of this and extensions, including variable mean 
curvature H (x), as well as extensive general discussions of boundary gradient 
estimates. We will have a little more practice constructing barriers and deducing 
boundary gradient estimates in §$ 13 and 15 of this chapter. See the proofs of 
Lemma 13.12 and of the estimate (15.54). 

Results discussed above extend to more general second-order, scalar, 
quasi-linear PDE. In particular, Proposition 10.10 can be extended to all equations 
of the form 


(10.68) S- ajn(a,u, Vu) Oj0,u + W(a,u, Vu) =0, Ulgg =¥. 


Let yp € C(O) be given. As long as it can be shown that, for each 7 € [0, 1], a 
solution to 


(10.69) S- ajn(Z, u, Vu) O;O,u + 7Tb(a, u, Vu) = 0, tl ae =TY, 


has an a priori bound in C1(Q), then (10.68) has a solution u € C%(Q). This 
result, due to O. Ladyzhenskaya and N. Ural’tseva, is proved in [GT] and [LU]. 
These references, as well as [Se2], also discuss conditions under which one can 
establish a boundary gradient estimate for solutions to such PDE, and when one 
can pass from that to a C!(()-estimate on solutions. The DeGiorgi-Nash—Moser 
estimates are still a major analytical tool in the proof of this general result, but 
further work is required beyond what was used to prove Proposition 10.10. 


Exercises 


1. Carry out the construction of barriers for the equation of a surface of constant mean cur- 
vature mentioned below (10.66) and thus obtain some existence results for this equation. 
Compare these results with the result of Jenkins and Serrin, stated in (10.67). 


Exercises 2—4 deal with quasi-linear elliptic equations of the form 
(10.70) De d;A7*(a,u)d,.u=0 oO, ul, =9. 
Assume there are positive functions A; such that 
Ai (u)l€? < 50 AP (@, u)&jEe < Ao(u)lél?. 
2. Fix y € C™°(0Q). Consider the operator ®(u) = v, the solution to 
S > 0; A7* (a, u) Ov =0, tile =. 
Show that, for some r > 0, 


6:0) —c’), 
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continuously. Use the Schauder fixed-point theorem to deduce that ® has a fixed point 
in {u € C(Q) : sup |u| < sup |y|}N C(O). 

3. Show that this fixed point lies in C™(Q). 

4. Examine whether solutions to (10.70) are unique. 

5. Extend results on (10.1) to the case 


(10.71) > Fp, (@, Vu) = 0, “ =P; 


arising from the search for critical points of I(u) = f, F q F(x, Vu) dx, generalizing the 
case considered in (10.2). 


In Exercises 6-9, we consider a PDE of the form 
(10.72) S 5 dja’ (x, u, Vu) + B(x, u) =0 on Q. 
We assume a? and b are smooth in their arguments and 
|a’(a,u,p)| < C(u)(p),  |Vpa" (x, u,p)| < C(u). 


We make the ellipticity hypothesis 


Oa! 
De Bp (tr tH DIEIEK = AWE, A(u) > 0. 
Dk 
6. Show that if u € H'(Q) NM L*°(Q) solves (10.72), then u solves a PDE of the form 
S_ 0; A"* (x) pu + Ojo! (w,u) + Blew, u) = 0, 
with 
-eEL™, S > A™(a)& Ex > Alél?. 
(Hint: Start with 
a? (x,u,p) = a) (x,u,0) + )> A” (a, u,p)pr, 


k 
Aa j 
A?¥ (a, u, p) — [ so (0.1, sp) ds.) 


7. Deduce that if u € H1(Q) M L°(Q) solves (10.72), then u is Hdlder continuous on 
the interior of (2. 

8. If 2 is a smooth, bounded region in R” and u € H'(Q) NM L©(Q) satisfies (10.72) and 
tlan = y € C'(9Q), show that u is Hélder continuous on Q and that Vu € M3(Q), 
for some q > n. 

9. Ifue C? (Q) satisfies (10.72), show that we = Ocu satisfies 


dja), (a, u, Vu) Opuc + Oj [a?, (a, u, Vu)ue] 
+0;a3,,(a,u, Vu) + bu(x, u)ue + bz, (x, u) = 0. 


Discuss obtaining estimates on u in C'*" (Q), given estimates on u in C1(Q). 
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11. Direct methods in the calculus of variations 


We study the existence of minima (or other stationary points) of functionals of the 
form 


(11.1) I(u) = | F(a,u, Vu) dV(a), 
{ 


on some set of functions, such as {u € B: u = g on OQ}, where B is a suitable 
Banach space of functions on Q, possibly taking values in R™, and g is a given 
smooth function on OQ. We assume © is a compact Riemannian manifold with 
boundary and 


(11.2) F:RN x (RX @T*Q) — R is continuous. 


Let us begin with a fairly direct generalization of the hypotheses (1.3)—(1.8) 
made in § 1. Thus, let 


(11.3) V ={ue H1(0,R) : u=g on OQ}. 

For now, we assume that, for each x € 2, 

(11.4) F(a,-,-): RN x (RX @ T*Q) — R is convex, 

where the domain has its natural linear structure. We also assume 

(11.5) Ag|é|? — Bo|ul — Co < F(a, u, €), 

for some positive constants Ag, Bo, Co, and 

(11.6) = | F(a, u,€) — F(2z,v,¢)| < C(Ju—v| + |€ — ¢|) (1€l + 1¢| + 1). 
These hypotheses will be relaxed below. 

Proposition 11.1. Assume Q is connected, with nonempty boundary. Assume 
I(u) < co for some u € V. Under the hypotheses (11.2)-(11.6), I has a min- 
imum on V. 

Proof. As in the situation dealt with in Proposition 1.2, we see that [: V + Ris 


Lipschitz continuous, bounded below, and convex. Thus, if a = infy I(u), then 


(11.7) Ke = {we V:ao < I(u) < ag +e} 
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is, for each e € (0,1], a nonempty, closed, convex subset of V. Hence K; is 
weakly compact in H'(Q,R™). Hence ()..) Ke = Ko # 0, and inf I(u) is 
assumed on Ko. 


e>0 
We will state a rather general result whose proof is given by the argument 
above. 


Proposition 11.2. Let V be a closed, convex subset of a reflexive Banach space 
W, and let ® : V + R be a continuous map, satisfying: 


(11.8) inf ® = ap € (—00, 0&0), 
(11.9) 3b > ao such that ®~*([ao, b|) is bounded in W, 
(11.10) Vy € (ao,0], &71([ao, y]) és convex. 


Then there exists v € V such that ®(v) = ao. 

As above, the proof comes down to the observation that, for0 < ¢ < 
b— ag, Kez is a nested family of subsets of W that are compact when W has 
the weak topology. This result encompasses such generalizations of Proposition 
11.1 as the following. Given p € (1,00), g € C~(00, RY), let 
(11.11) V ={ue€ H17(Q,RY): u=g on AQ}. 

We continue to assume (11.4), but replace (11.5) and (11.6) by 
(11.12) Ag|g|? — Bo|u| — Co < F(a, u, €), 


for some positive Ap, Bo, Co, and 


(11.13) |F(a,u,€) — F(#,v,0)] < C(ju—o] + €-C) (el 4 1G) 4)". 


Then we have the following: 


Proposition 11.3. Assume Q is connected, with nonempty boundary. Take p € 
(1,00), and assume I(u) < 00 for some u € V. Under the hypotheses (11.2), 
(11.4), and (11.11)-(11.13), I has a minumum on V. 


It is useful to extend Propositions 11.1 and 11.3, replacing (11.4) by a hypoth- 
esis of convexity only in the last set of variables. 


Proposition 11.4. Make the hypotheses of Proposition 11.1, or more generally of 
Proposition 11.3, but weaken (11.4) to the hypothesis that 


(11.14) F(a, u,-): RY @ TQ — R is convex, 


for each (a,u) € Q x R%. Then I has a minimum on V. 
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Proof. Let ag = infy I(u). The hypothesis (11.12) plus Poincaré’s inequality 
imply that ag > —oo and that 


(11.15) B={ueV:I(u) < ao +1} is bounded in H1?(Q, RY). 
Pick u; € B so that I(u;) — ao. Passing to a subsequence, we can assume 
(11.16) uj; > u weakly in H1?(Q, RY). 

Hence u; — wu strongly in L?(Q, IR“). We want to show that 

(11.17) I(u) = ao. 


To this end, set 


(11.18) P(u,v) = [Few dV (x). 
Q 


With v; = Vuj, we have 
(11.19) ®(u;,v;) > ao. 


Also vj + v = Vu weakly in L?(Q,R @ T*). 
We can conclude that I(u) < ao, and hence (11.17) holds if we show that 


(11.20) P(u,v) < ao. 


Now, by hypothesis (11.13) we have 


|®(uj, 4) — ®(u, v4)| < C / lug — ul (lvg| +1)?" dV (a) 
Q 


(11.21) 
< C"|luj — ullzea); 

SO 

(11.22) B(u, vj) — ao. 


This time, by (11.5), (11.6), and (11.14) we have that, for each € € (0, 1], 
(11.23) K. = {w € L?(0,R% @T*) : B(u,w) < ap +} 
is a closed, convex subset of L?(Q,R @ T*). Hence K- is weakly compact, 


provided it is nonempty. Furthermore, by (11.22), uj € K-, with e; — 0, so we 
have v € Ko. This implies (11.20), so Proposition 11.4 is proved. 
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The following extension of Proposition 11.4 applies to certain constrained 
minimization problems. 


Proposition 11.5. Let p € (1,00), and let F(x,u,€) satisfy the hypotheses of 
Proposition 11.4. Then, if S is any subset of V (given by (11.11)) that is closed in 
the weak topology of H'?(Q,R), it follows that iP has a minimum in S. 


Proof. Let ag = infs I(u), and take u; € S, I(u;) - ag. Since (11.15) holds, 
we can take a subsequence u; + u weakly in H1?(Q, RY), sou € S. We want to 
show that J(u) = ao. Indeed, if we form ®(u, v) as in (11.18), then the argument 
involving (11.19)—(11.23) continues to hold, and our assertion is proved. 


For example, if X C RY is a closed subset, we could take 
(11.24) S={ueV:u(2) €X forae. x € OI, 


and Proposition 11.5 applies. As a specific example, X could be a compact Rie- 
mannian manifold, isometrically imbedded in R%, and we could take p = 2, 
F(a,u, Vu) = |Vul?. The resulting minimum of J(u) is a harmonic map of 
Q into X. If u: Q —+ X is a harmonic map, it satisfies the PDE 


(11.25) Au —T(u)(Vu, Vu) = 0, 


where I'(u)(Vu, Vu) is a certain quadratic form in Vu. See § 2 of Chap. 15 for a 
derivation. 

A generalization of the notion of harmonic map arises in the study of “liquid 
crystals.” One takes 


(11.26) F(x,u, Vu) = a1|Vul? +a2(div u)?+a3(u-curl u)?+a4|u x curl ul?, 
where the coefficients a; are positive constants, and then one minimizes the func- 
tional [, F(#, u, Vu) dV (a) over a set S' of the form (11.24), with X = S? CR’, 
namely, over 

(11.27) S= {ue H'(O,R?®): ju(x)| =1ae.onQ, u=g on dQ}. 


In this case, F(x, u, €) has the form 


F(x, u g)= 2 bale WE bj,.a(u) = a, > 0, 


where each coefficient b;.(u) is a polynomial of degree 2 in u. Clearly, this func- 
tion is convex in . The function F(a, u, €) does not satisfy (11.6); hence, in going 
through the argument establishing Proposition 11.4, we would need to replace the 
p = 2 case of (11.22) by 
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(11.28) |®(u;, vj) — (u, v;)| < c| |uj — ul» |v;|? dV (x). 
2 


The following result covers integrands of the form (11.26), as well as many 
others. It assumes a slightly bigger lower bound on fF’ than the previous results, 
but it greatly relaxes the hypotheses on how rapidly F' can vary. 


Theorem 11.6. Assume Q is connected, with nonempty boundary. Take p € 
(1, 00), and set 


V ={u€ H17(0,RY): u=g on AQ}. 
Assume I(u) < oo for some u € V. Assume that F'(x,u,&) is smooth in 
its arguments and satisfies the convexity condition (11.14) in & and the lower 
bound 
(11.29) Ag|g|? < F(a, u, €), 
for some Ag > 0. Then I has a minimum on V. 
Also, if S is a subset of V that is closed in the weak topology of H'?(Q, RY), 


then I. has a minimum in S. 


Proof. Clearly, a9 = infs I(u) > 0. With B as in (11.15), pick u; € BN S so 
that 


(11.30) I(uj) 3 a0, Uy + u weakly in H1?(Q,RY). 


Passing to a subsequence, we can assume u; — u a.e. on 2. We need to show that 


(11.31) [ Feu) dV <a. 
Q 
By Egorov’s theorem, we can pick measurable sets E, D E,41 D--- inQ, of 


measure < 2~”, such that u; — u uniformly on 2 \ E,,. We can also arrange that 
(11.32) ju(a)| + |Vu(a)] <C-2”, for 7 €O\ E,. 


Now, we have 
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j F(ax,u, Vu) dV = ; F(a2,u;,Vu;) dV 
OQ\ Ev OQ\ EL 


(11.33) 1 / [F(z, uj, Vu) — F(x, uj, Vu,)] dV 
O\EL 


+ / [F(2,u, Vu) — F(x, uj, Vu)] av. 


OVE, 


To estimate the second integral on the right side of (11.33), we use the convexity 
hypothesis to write 


(11.34) F(a,u;, Vu) — F(2,u;, Vuj;) < DeF (a, u;, Vu) - (Vu — Vu;). 
Now, for each rv, 
(11.35) D-F («,u;,Vu) — DeF(ax,u, Vu), uniformly on 2 \ EL, 


while Vu — Vu; — 0 weakly in L?(Q,R”), so 


(11.36) lim [F (2, uj, Vu) — F(a, uj, Vu;)] dV = 0. 
W eed®,°) 


O\EL 
Estimating the last integral in (11.33) is easy, since 
(11.37) F(ax,u,Vu) — F(a,u;,Vu) —>0, uniformly on 2 \ E,. 


Thus, from our analysis of (11.33), we have 


(11.38) / F(a,u, Vu) dV < limsup i F(ax,u,;,Vu;) dV < a9, 
j-co 


Q\ Ey Q\E, 
for all v, and taking v — oo gives (11.31). The theorem is proved. 
There are a number of variants of the results above. We mention one: 
Proposition 11.7. Assume that F is smooth in (x, u, €), that 
(11.39) Pieuye) > 0, 
and that 


(11.40) F(a, u,-) : RY @ TQ — R is convex, 
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for each x, u. Suppose 


(11.41) uy > uweakly in H(Q,R). 
Then 
(11.42) I(u) < liminf I(u,). 


For a proof, and other extensions, see [Gia] or [Dac]. It is a result of J. Serrin 
[Sel] that, in the case where wu is real-valued, the hypothesis (11.41) can be 
weakened to 


(11.43) Uwe He (OO), ty—sw im LEO): 


In [Mor2] there is an attempt to extend Serrin’s result to systems, but it was shown 
by [Eis] that such an extension is false. 

In [Dac] there is also a discussion of a replacement for convexity, due to 
Morrey, called “quasi-convexity.” For other contexts in which the convexity 
hypothesis is absent, and one often looks not for a minimizer but some sort of 
saddle point, see [Str2] and [Gia2]. 

In this section we have obtained solutions to extremal problems, but these 
solutions lie in Sobolev spaces with rather low regularity. The problem of higher 
regularity for such solutions is considered in § 12. 


Exercises 
1. In Theorem 11.6, take p > n = dim 2 = N, and consider 
S={ueV: det Du=1, ae.onO}. 


Show that S' is closed in the weak topology of H'?(Q,R”) and hence that Theorem 
11.6 applies. (Hint: See (6.35)-(6.36) of Chap. 13.) 
2. In Theorem 11.6, take p € (1,00), QC R”, N =1.Leth € C™(Q), and consider 


S={weV:u>honQ}. 
Show that S is closed in the weak topology of H'?(Q) and hence that Theorem 11.6 
applies. 
Say I | g achieves its minimum at uw, and suppose you are given that u € C(Q), so 


O={x# ED: u(x) > h(x)} 


is open. Assume also that OF /O€; and OF'/Ou satisfy convenient bounds. Show that, 
on O, u satisfies the PDE 


>, Oj Fe, (a, u, Vu) + Fu(a,u, Vu) = 0. 


g 
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For more on this sort of variational problem, see [KS]. 


12. Quasi-linear elliptic systems 


Here we (partially) extend the study of the scalar equation (10.1) to a study of an 
N x N system 


(12.1) AI, (Vu)dj0.u® =0 on Q, uw=¢ on AQ, 


where ~p € C™®(00,RY) is given. The hypothesis of strong ellipticity used 
previously is 


(12.2) S- AMG (p)vavpejee > Clui*|é?, C>0, 


but many nonlinear results require that AS (p) satisfy the very strong ellipticity 
hypothesis: 


(12.3) So AMG (P) Gabe > KIC?, «> 0. 


We mention that, in much of the literature, (12.3) is called strong ellipticity and 
(12.2) is called the “Legendre-—Hadamard condition.” 
In the case when (12.1) arises from minimizing the function 


(12.4) I(u) = [ro dx, 
Q 

we have 

(12.5) AMG (P) = Opie Orns ¥ (P)- 


In such a case, (12.3) is the statement that F'(p) is a uniformly strongly convex 
function of p. If (12.5) holds, (12.1) can be written as 


(12.6) S°d,Gi(Vu) =0 on Q, u=y on AQ; G4(p) = dp,,F(p). 
J 


We will assume 


ag|p|? — bo < F(p) < ai|p|* + b1, 


(12.7) . 
IG2(p)| < Co(p), — [Ada (p)| S C1. 


These are called “controllable growth conditions.” 
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If (12.5) holds, then 


2jG4 (Vu) — 0jGL (Tv) = 0.45, (2) Ox u! — v°), 
(12.8) 1 - 
Alig (&) = | Aljg(sVu+t (1 — s)Vv) ds. 


This leads to a uniqueness result: 


Proposition 12.1. Assume Q C R” is a smoothly bounded domain, and assume 
that (12.3) and (12.7) hold. If u,v € H*(Q,R%) both solve (12.6), then u = v 
on). 


Proof. By (12.8), we have 


(12.9) fae (x) 0;(u% — v%) O,(u% — v®) dx = 0, 


so (12.3) implies 0;(u — v) = 0, which immediately gives u = v. 


Let X = > b“0¢ be a smooth vector field on , tangent to 02. If we knew that 
u € H?(Q), we could deduce that ux = Xuis the unique solution in H1(Q, RY) 
to 


(12.10) S 0; A7*(Vu) Onux =S ofits, ux = Xy on OO, 
where 


f? = AJ*(Vu)(Onb) (Oeu) + (cb) G4 (Vu), 


12.11 
: —(0,0;b°)G3 (Vu). 


g 


Under the growth hypothesis (12.7), |f?(«)| < C|Vu(a)|, so ||f*\|z2(a) < 
C||Vull 2a). Similarly, ||g||L2(a) < C||Vul|z2(q) + C. Hence, we can say that 
(12.10) has a unique solution, satisfying 


(12.12) lux l(a) < C(llull2(ay + [l¥llz2qay + 1). 


It is unsatisfactory to hypothesize that u belong to H?(Q), so we replace the 
differentiation of (12.6) by taking difference quotients. Let F‘ denote the flow 
on Q generated by X, and set uz, = uo F um Then up, extremizes a functional 


(12.13) Ip,(un) = | P@,Yun) dx, 
Q 


where F},(a,p) depends smoothly on (h,x,p) and Fo(x,p) = F(p). (In fact, 
(12.13) is simply (12.4), after a coordinate change.) Thus wy, satisfies the PDE 
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(12.14) Oj (Opin Fa) (2, Vun) = 0, Un = Pr ON on. 


Applying the fundamental theorem of calculus to the difference of (12.14) and 
(12.6), we have 


— uP 


uP ; 
(12.15) O;AN,, (2) Oe (“2 =) = 0;H2,(2, Vun), 


h 


where Ae x) is as in (12.8), with v = up, and 


(12.16) He) -[3 oF Op; H's) (2, p) ds. 

As in the analysis of (12.10), we have 

(12.17) |h~* (un — u) |x) < C(llullz2(@y + [¥llz2@) +1). 
Taking h > 0, we have ux € H1(Q, RY), with the estimate (12.12). 

From here, a standard use of ellipticity, parallel to the argument in 
(10.21)-(10.25), gives an H'-bound on a transversal derivative of u; hence 
u € H?(Q,R"), and 
(12.18) lull z72¢0) < C(llullencay + Il¢llz2@) + 1). 

As in the scalar case, one of the keys to the further analysis of a solution to 


(12.6) is an examination of regularity for solutions to linear elliptic systems with 
L°°-coefficients. Thus we consider linear operators of the form 


(12.19) Lu = W(x >> 0; (AI*(a)b(x) Ox), 


jk=l1 
Compare with (9.1). Here wu takes values in R% and each A?” is an N x N matrix, 


with real-valued entries A2* wa © L(Q). We assume AN = AR. As in (12.3), 
we make the hypothesis 


(12.20) MIC? = $5 AG (x)GjaCna = AolG/?, Av > 0, 


of very strong ellipticity. Thus Ane *. defines a positive-definite inner product ( , ) 
on T* @ RX. We also assume 


(12.21) 0< Cy < W(x) < Ch. 


Then b(a) dx = dV defines a volume element, and, for y € C4(Q, RY), 
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(12.22) (Lu, ¢) =- fi Vu, Vy) d 
Q 


We will establish the following result of [Mey]. 


Proposition 12.2. Let Q C R” be a bounded domain with smooth boundary, let 
f; € L1(Q,RY) for some q > 2, and let u be the unique solution in Hy? (Q) to 


(12.23) In=>~ 0; fj. 


Assume L has the form (12.19), with coefficients AJF € L*(Q), satisfying 
(12.20), and b € C®%(Q), satisfying (12.21). Then u € H'?(Q), for some p > 2. 


Proof. We define the affine map 

(12.24) Tei," (O)— FO) 

as follows. Let A be the Laplace operator on 9, endowed with a smooth 
Riemannian metric whose volume element is dV = b(x) dx, and adjust Xo, A1 


so (12.20) holds when |¢|? is computed via the inner product (, ) on T* @ RY 
associated with this metric, so that 


(12.25) (Au, vy) = - [ (74,99) dv. 
Q 


Then we define Tw = v to be the unique solution in Hj’*(Q) to 
(12.26) Av = Aw — Aj 'Lwt Az! S005 fj. 


The mapping property (12.24) holds for 2 < p < q, by the L?-estimates of 
Chap. 13. In fact, if Av = > 0;9;, v € Hy’7(Q), then 
(12.27) lVullzecay < C(p)Ilgll zea) 


If we fix r > 2, then, for 2 < p <r, interpolation yields such an estimate, with 


(12.28) C(p)=C(r)®, ———+-=-, ie, @= 
Tr 
Hence C(p) \, 1, as p \, 2. Now we see that Tw — Tw = v1 — v2 satisfies 


(12.29) A(v1 — v2) = (A — Az1L) (wi — we) = Vo, 


where 
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(12.30) 99 = ;(we — wS) — Apt As, On (wl — wh), 


and hence, under our hypotheses, 


r 
(12.31) lallzncey < (1— $2) IV (wr — w2)ll crop, 
1 
SO 
Xo 
(12.32) [IV(v1 ~ v2) lancay $ C)(1— FZ) IV Cwn — w2)Ihuoca, 


for 2 < p < q. We see that, for some p > 2, C(p)(1 — Ao/A1) < 1; hence T 
is a contraction on H!-?(Q) in such a case. Thus T has a unique fixed point. This 
fixed point is u, so we have u € Hy’?(Q), as claimed. 


Corollary 12.3. With hypotheses as in Proposition 12.2, given a function y © 
H14(Q), the unique solution u € H':?(Q) satisfying (12.23) and 
(12.33) u=w on OQ 


also belongs to H':?(Q), for some p > 2. 


Proof. Apply Proposition 12.2 to u— w. 


Let us return to the analysis of a solution u € H1(Q,R) to the nonlinear 
system (12.6), under the hypotheses of Proposition 12.1. Since we have estab- 
lished that u € H?(Q,R), we have a bound 


(12.34) |Vullraa) <A, gq >2. 


In fact, this holds with ¢ = 2n/(n — 2) if n > 3, and for all g < co ifn = 2. 
As above, if X = >> bf Op is a smooth vector field on 2, tangent to 0, then 
ux = Xuwis the unique solution in H!(Q,R%) to (12.10), and we can now say 
that f? € L4(Q). Thus Corollary 12.3 gives 


(12.35) Xue H'?(Q), for some p > 2, 
with a bound, and again a standard use of ellipticity gives an H'?-bound on a 
transversal derivative of u. We have established the following result. 


Theorem 12.4. [fu € H'(Q,R*) solves (12.6) on a smoothly bounded domain 
Q € R”, and if the very strong ellipticity hypothesis (12.3) and the controllable 
growth hypothesis (12.7) hold, then u € H??(Q, RY), for some p > 2, and 


(12.36) I!ulla2.2ca) < C(||Vullr2(@) + ll¢llz2<cay + 1). 


The case n = dim (2 = 2 of this result is particularly significant, since, for 
p >n, H+?(Q) c C"(Q), r > 0. Thus, under the hypotheses of Theorem 12.4, 
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we have u € C!+"(Q), for some r > 0, if n = 2. Then the material of § 8 applies 
to (12.1), so we have the following: 


Proposition 12.5. Jf u € H'(Q,R%) solves (12.6) on a smoothly bounded 
domain Q C R?, and the hypotheses (12.3) and (12.7) hold, then u € C®(Q), 
provided py € C® (OQ). 


When n = 2, we then have existence of a unique smooth solution to (12.1), 
given y € C™(OQ). In fact, we have two routes to such existence. We could 
obtain a minimizer u € H1(Q,R%) for (12.4), subject to the condition that 
ul aq = & by the results of § 11, and then apply Proposition 12.5 to deduce 
smoothness. 

Alternatively, we could apply the continuity method, to solve 


(12.37) Al", (Vu)d;0,u® =0 on Q, w=Ty on AQ. 


This is clearly solvable for 7 = 0, and the proof that the biggest 7-interval 
J c [0,1], containing 0, on which (12.37) has a unique solution u € C%°(Q), 
is both open and closed is accomplished along lines similar to arguments in § 10. 
However, unlike in § 10, we do not need to establish a sup-norm bound on Vu, 
or even on u; we make do with an H!-norm bound, which can be deduced from 
(12.3) as follows. 

If Ae) is given by (12.8), with v = y, we have 


[AE Oulu? — 99) 0j(u8 — 98) ae 
(12.38) - 
= faci cve\(ut — 9%) ae, 
Q 
for a solution to (12.37) (in case T = 1). Hence 
(12.39) K||V(u— v)\IZ2@) < Cllu— ¢llzz(ay- 


Note the different exponents. We have ||u — 9llz2(@) < C4||V(u - ¥)\IZ2(a)> by 
Poincaré’s inequality, so 


(12.40) lu ella $ 
Plugging this back into (12.39) gives 

C2 
(12.41) |V(u— v)llZ2~a) < 20y 


which implies the desired H'!-bound on uw. 
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Once we have the H!-bound on u = u-, (12.36) gives an H?-?-bound for 
some p > 2, hence a bound in C'*"(Q), for some r > 0. Then the results of § 8 
give bounds in higher norms, sufficient to show that J is closed. 

Proposition 12.5 does not in itself imply all the results of § 10 when dim 2 = 2, 
since the hypotheses (12.3) and (12.7) imply that (12.1) is uniformly elliptic. For 
example, the minimal surface equation is not covered by Proposition 12.5. How- 
ever, it is a simple matter to prove the following result, which does (essentially) 
contain the n = 2 case of Theorem 10.2. 


Proposition 12.6. Assume A (p) is smooth in p and satisfies 


(12.42) ANs(P)GjaCks = C(p)IC|?, C(p) > 0. 


Let Q C R? be a smoothly bounded domain. Then the Dirichlet problem (12.1) 
has a unique solution u € C®(Q), provided one has an a priori bound 


(12.43) |Vur||z~(a) <K, 
for all smooth solutions u = u, to (12.37), for 7 € (0, 1]. 


Proof. Use the method of continuity, as above. To prove that J is closed, simply 
modify F'\(p) on {p : |p| > K +1} to obtain F'(p), satisfying (12.3) and (12.7). 
The solution u, to (12.1) for 7 € J also solves the modified equation, for which 
(12.36) works, so as above we have strong norm bounds on wu, as T approaches 
an endpoint of J. 


Recall that, for scalar equations, (12.43) follows from a boundary gradient esti- 
mate, via the maximum principle. The maximum principle is not available for 
general elliptic V x N systems, even under the very strong ellipticity hypothesis, 
so (12.43) is then a more severe hypothesis. 

Moving beyond the case n = 2, we need to confront the fact that solutions 
to elliptic PDE of the form (12.1) need not be smooth everywhere. A number 
of examples have been found; we give one of J. Necas [Nec], where AN, (p) in 
(12.1) has the form (12.5), satisfying (12.3), such that F'(p) satisfies |D° F'(p)| < 
Ca(p)—'!|p|?, Va > 0. Namely, take 


1 dus Out) as pw Oud Oukk 
2 Ox, Ox, 2 Ox, Ox; 
Ou du%* du” dui* 


-2 
PA ae OX, Oxye OX wer 


F(Vu) = 


(12.44) 


. 2 
where wu takes values in M,,,.,, ~ IR” , and we set 


n® —1 4+na 
(12.45) em CORPS LET: yaa)? b= 
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Since A, ps + 0 as n — oo, we have ellipticity for sufficiently large n. But for any 
n, 
UX 5 


(12.46) ud (x2) = i 


is a solution to (12.1). Thus wu is Lipschitz but not Ct on every neighborhood 
of 0 € R”. See [Gia] for other examples. Also, when one looks at more general 
classes of nonlinear elliptic systems, there are examples of singular solutions even 
in the case n = 2; this is discussed further in § 12B. 

We now discuss some results known as partial regularity, to the effect that 
solutions u € H!(Q, RY) to (12.1) can be singular only on relatively small sub- 
sets of 2. 

We will measure how small the singular set is via the Hausdorff s-dimensional 
measure H.*, which is defined for s € [0, co) as follows. First, given p > 0, SC 
R”, set 


(12.47) hE ,(S) =inf S “(diam Y;)°: Sc LJ ¥;, diam ¥; < e}. 


j21 j21 


Here diam Y; = sup{|x — y| : 2,y € Yj}. Each set function h , is an outer 
measure on R”. As p decreases, h ,(.S) increases. Set 


(12.48) h=(S) = lim hz plo). 
pO 


Then h*(S') is an outer measure. It is seen to be a metric outer measure, that is, 
if A,B C R” and inf{|z — y| : « € Avy € B} > O, then h§(A UB) = 
h=(A) + hz(B). It follows by a fundamental theorem of Caratheodory that every 
Borel set in R” is ht-measurable. For any h-measurable set A, we set 


12.49 41°(A) = ygh®(A mei?a-* 
(12.49) (A) = 7sh5(A), = THI)’ 


the factor y, being picked so that if k < n is an integer and S C R” is a smooth, 
k-dimensional surface, then (S) is exactly the k-dimensional surface area of S. 
Treatments of Hausdorff measure can be found in [EG, Fed, Fol]. 

Our next goal will be to establish the following result. Assume n > 3. 


Theorem 12.7. Jf Q C R” is a smoothly bounded domain and u € H'(Q, RY) 
solves (12.1), then there exists an open Qo C Q such that u € C%(Qo) and 


(12.50) H"(Q\ OQ) =0, for some r<n-2. 


We know from Theorem 12.4 that u € H??(Q,R%), for some p > 2. Hence 
(12.10) holds for derivatives of w; in particular, 
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(12.51) ue = Ogu => ue € H*?(0,RY) 

and 

(12.52) 0; AI*(Vu)Opue =0, 1< <n. 

Regarding this as an elliptic system for v = (O,u,...,0,u), we see that to 


establish Theorem 12.7, it suffices to prove the following: 


Proposition 12.8. Assume that v € H'?(Q,R™), for some p > 2, and that v 
solves the system 


(12.53) 0; AP* (ax, v) Oxv = 0, 
where AL, v) is uniformly continuous in (a, v) and satisfies 
(12.54) MIC)? = A254 (x, v) jake = AvICl?, Ao > 0. 


Then there is an open Qo C Q such that v is Hélder continuous on Qo, and (12.50) 
holds. 


In turn, we will derive Proposition 12.8 from the following more precise result: 


Proposition 12.9. Under the hypotheses of Proposition 12.8, consider the subset 
XC Q defined by 


(12.55) rei liminf R™ / |v(y) — ve,r|? dy > 0, 
Br(2) 

where 
(12.56) A : : (ya 

; Ue R= = v : 

,R VE Br(z) Uv Vol Br(x) Yy) ay 

Then 
(12.57) H' (=) =0, for some r<n-—2, 


and & contains a closed subset > of Q such that v is Holder continuous on Qo = 
Q\ >. 


Note that every point of continuity of v belongs to Q \ &; it follows from 
Proposition 12.9 that v is Hélder continuous on a neighborhood of every point of 
continuity, under the hypotheses of Proposition 12.8. As Lemma 12.11 will show, 
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for this fact we need assume only that u € H!?, instead of u € H'? for some 
p>2. 


Let us first prove that &, defined by (12.55), has the property (12.57). First, by 
Poincaré’s inequality, 


(12.58) we {« €Q:liminf R" / |Vo(y)|2 dy > 0}. 
Br(«) 

Since Vu € L?(Q) for some p > 2, Hélder’s inequality implies 

(12.59) ie {x €Q:liminf RP" / |Vu(y)|? dy > 0}. 
Br(a) 

Therefore, (12.57) is a consequence of the following. 


Lemma 12.10. Given w € L1(Q), 0< 5 <n, let 


(12.60) E, = {a €Q:limsup r7° / |w(y)| dy > 0}. 
r—0 
B, (ax) 
Then 
(12.61) Ho" (E,)=0, Ve>d. 


It is actually true that H*(E,) = 0 (see [EG] and [Gia]), but to shorten the 
argument we will merely prove the weaker result (12.61), which will suffice for 
our purposes. In fact, we will show that 


(12.62) H (Ess) < 00, WO>0, 
where 
Eys5 = {« €Q:limsup r-* / |w(y)| dy > 5}. 
r—0 
B,(@) 


This implies that H*t*(E.5) = 0, Ve > 0, and since E, = U,, Es,1/n, this 
yields (12.61). 
As a tool in the argument, we use the following: 


Vitali covering lemma. Let C be a collection of closed balls in R” (with positive 
radius) such that diam B < Co < ov, forall B € C. Then there exists a countable 
family F of disjoint balls in C such that 
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(12.63) Li BS ChB, 


BEF Bec 


where B is a ball concentric with B, with five times its radius. 
Sketch of proof. Take C; = {B €C:2-ICo < diam B < 2! JC}. Let Fy 


be a maximal disjoint collection of balls in C1. Inductively, let 7; be a maximal 
disjoint set of balls in 


{B €C, : B disjoint from all balls in F1,...,F,-1}. 


Then set F = (J F;. One can then verify (12.63). 


To begin the proof of (12.62), note that, for each p > 0, E,5 is covered by a 
collection C of balls B, of radius rz, < p, such that 


(12.64) [helo ay > ors. 
Bz 


Thus there is a collection F of disjoint balls B, in C (of radius r,,) such that 
(12.63) holds. In particular, {B,} covers E,5, so 


* Ss Cn Cr 
(12.65) Wp Ens) < On DI rh SF / jw(y)| dy < Hella); 


UB, 
where C,, is independent of p. This proves (12.62) and hence Lemma 12.10. 


Thus we have (12.57) in Proposition 12.9. To prove the other results stated in 
that proposition, we will establish the following: 


Lemma 12.11. Given 7 € (0,1), there exist constants 
€9 = €0(7,n,M,A9*A1), Ro = Ro(t,n,M,AQ*A1), 
and furthermore there exists a constant 
Ap = Ao(n, M, 9 *A1), 


independent of T, such that the following holds. If u € H!(Q, R™) solves (12.53) 
and if, for some xo € Q and some 


R < Ro(xo) = min( Ro, dist(xo, 0Q)), 
we have 


(12.66) (aay) < 65, 
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where 

(12.67) U(0o,R)= Rf July) = o.al? ay 
Br(2o) 

then 

(12.68) U(ao, TR) < 2Agr?U (x0, R). 


Let us show how this result yields Proposition 12.9. Pick a € (0,1), and 
choose 7 € (0,1) such that 2A97?~?* = 1. Suppose xp € Q and R < min(Ro, 
dist(ao, 0Q)), and suppose (12.66) holds. Then (12.68) implies 

U(2o,TR) < 77°U (20, R). 


In particular, U(2p, TR) < U(xo, R) < €2, so inductively the implication (12.66) 
=> (12.68) yields 


U(ao,7°R) < 72 U (xp, R). 
Hence, for p < R, 
p 2a 
(12.69) U(eo,p) <C(F) U(eo,R). 
Note that, for fixed R > 0, U(ao, R) is continuous in xo, so if (12.66) holds 


at xo, then we have U(x, R) < €2 for every x in some neighborhood B,.(ao) of 
Xo, and hence 


2a 
U(x, p) < o( 4) U(a,R), «x € B,(xo); 
that is, we have 


(12.70) |u(y) — Ux,pl? dy < Cp™t?> 


Bp (x) 
uniformly for « € B,(xo). This implies, by Proposition A.2, 
(12.71) u € C*(B,(20)). 


In fact, we can say more. Extending some of the preliminary results of § 9, we 
have, for a solution u € H1+(Q) of (12.53), estimates of the form 


(12.72) IVullzace,.(@)) < Ce? |u(y) — te,pl? dy; 


By(x) 
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see Exercise 2 below. Consequently, (12.70) implies 


(12.73) Vel sca) © ME (Br(20)), 9 = = 


which by Morrey’s lemma implies (12.71). Thus, granted Lemma 12.11, 
Proposition 12.9 is proved, with 


(12.74) Q={aoEQ: inf  U(ao,R) < e}, 
R<Ro(xo) 


since clearly © > Q\ Q =. 

The proof of Lemma 12.11 (following the exposition in [Gia]) evolved from 
work of E. DeGiorgi [DeG2] and F. Almgren [Alm2] on regularity for minimal 
surfaces. It consists of blowing up small neighborhoods of x9 and obtaining a 
limiting PDE for a limit of the resulting dilations of u. As a preliminary to the 
proof of Lemma 12.11, we first identify the constant Ag. 


Lemma 12.12. There is a constant Ag = Ao(n,M, A1/Ao) such that whenever 
ve are constants satisfying 


(12.75) MIC? = S> we GaCes > Aol¢I?, Av > 0, 
the following holds. If u € H'(B,(0),R™) solves 


(12.76) O;b20,u =0 on B,(0), 

then, for all p € (0,1), 

(12.77) U(0, p) < Aop”?U(0, 1). 

Proof. For p € (0, 1/2], we have 

12.78) UO,e) <p f \Vuly)P dy < CrP? IVul cB, 00 


B, (0) 


On the other hand, regularity for the constant-coefficient, elliptic PDE (12.76) 
readily yields an estimate 


(12.79) ||Vull7<(B,,.(0)) < Boll VullZ2¢e,,4(0)) S Billu — worllz2ce, (0): 


with B; = B;(n, M, A1/o), from which (12.77) easily follows. 


We now tackle the proof of Lemma 12.11. If the conclusion (12.68) is false, 
then there exist T € (0,1) andz, €, e, 30, R, 3 0,andu, € H1(0,R™), 
solving (12.53), such that 
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(12.80) Ue, tol Set, Uieus Rs) > 2hgr eS. 
To implement the dilation argument mentioned above, we set 
(12.81) v(x) =e, (u(x, + Ryx) — uv2,,r, |: 
Then v,, solves 


(12.82) 9; ANS (a, + Rva, evry (x) + e,,p,) Ove =O on By (0). 


If we set 
Vi(0,)= 0" ff luv(y) ~ wool? dy 

B,(0 

(12.83) mn 
= 6529" fh uly) ~ te. P dy, 
Bory (av) 

we have (since v,9,1 = 0) 
(12.84) V.(0, 1) = llevlZ2(8,@) =1, VW(0,r) > 2Aor?. 


Passing to a subsequence, we can assume that 
(12.85) Uy > v weakly in L?(B,(0),R”), e,v, > Oae. in By (0). 
Also 


(12.86) Als (By; Uve,,R,) —> bags 


an array of constants satisfying (12.75). The uniform continuity of A then 
implies 


(12.87) AMG (a, + Rx, eyvy(2) + Uve,,r,) —> 25 ae.in By(0). 
Now, as in (12.72), the fact that v, solves (12.82) implies 

(12.88) lw llea1(B,0)) < Cp, Ve<t. 

Hence, passing to a further subsequence if necessary, we have 


Uy —> v strongly in Li,,(B1(0)) 


(12.89) os 
Vu, — Vv weakly in Ly, (Bi(0)). 
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Since the functions in (12.87) are uniformly bounded on B,(0), these results 
imply that we can pass to the limit in (12.82), to conclude that 


(12.90) d;b)%, xv? =0 on By (0). 

Then Lemma 12.12 implies 

(12.91) V(0,7) < Apr?V(0, 1), 

which is < Agr? by (12.85). On the other hand, (12.89) implies 
(12.92) V (0,7) > 2Aor? 


if (12.80) holds. This contradiction proves Lemma 12.11. 

Hence the proof of Proposition 12.9 is complete, so we have Theorem 12.7. 

Theorem 12.7 can be extended to a result on partial regularity up to the bound- 
ary (see [Gia]). 

There is a condition more general than strong convexity on the integrand in 
(12.4), known as “quasi-convexity,’ under which extrema for (12.4) have been 
shown to possess partial regularity of the sort established in Theorem 12.7 (see 
[Ev3]). 

There are also some results on regularity everywhere for stationary points of 
(12.4) when 2 has dimension > 3. A notable result of [U] is that such solutions 
are smooth on 2 provided F'(Vu) in (12.4), in addition to being strongly convex 
in Vu and satisfying the controllable growth conditions, depends only on |Vu|?. 
A proof can also be found in [Gia]. 


Exercises 


In Exercises 1-3, we consider an N x N system 
(12.93) S- 0; A246 (x)Oxu” = S$" O;f# on Bi = {x ER” : |2| < 1}, 


under the very strong ellipticity hypothesis (12.20). Assume f; € LP (Bi). 
1. Show that, with C = C(Ao, A1), 


(12.94) |Vull 22 (54,2) < Cllulln2¢a,) + cye fs llz2¢8,)- 


(Hint: Extend (9.6).) 
2. Let 6-u(x) = u(r). Show that, for r € (0, 1], 


(12.95) I6r(Wu)z2¢m, 4) S Cr lor — Wllz2ay) +O D> Mor fillaca,y> 


where U = Avg, wu. (Hint: First apply a dilation argument to (12.94). Then apply the 
result to wu — UW.) This sort of estimate is called a ““Caccioppoli inequality.” 
3. Deduce from Exercise 2 that if u € H'(Q) solves (12.93), then 
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(12.96) 
Il5r(Ve) |22(81 2) S Cllor(Vu)||na¢a1) + cy. lor filln2ayy, @= 


n+2 


This sort of estimate is sometimes called a “reverse HGlder inequality.” 
4. Deduce from (12.95) that if u € H*(Q) solves (12.93), then, forO0 <r < 1, 


n 


(12.97) UWE C"(B1), fi € M3 (B1), p= 1 Vu M3 (Bi/2). 


-r 
Compare (9.41)—(9.42). 

5. Let C(p) be the constant in (12.27), in case 2 = B1. Show that if C(n)(1—Ao/A1) < 
1, then a solution u € Hg(Q) to (12.93) is Hélder continuous on By, provided f; € 
L1(B,) for some q > n. Consider the problem of obtaining precise estimates on C'(p) 
in this case. 
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Regularity questions can become more complex when lower-order terms are 
added to systems of the form (12.1). In fact, there are extra complications even 
for solutions to a semilinear system of the form 

(12b.1) Lu+ B(a,u, Vu) = f, 

where L is a second-order, linear elliptic differential operator and B(x, wu, /p) is 
smooth in its arguments. One limitation on what one could possibly prove is given 
by the following example of J. Frehse [Freh], namely that 

(12b.2) ui(a) = sinloglog|z|~', —u2(a) = cos log log |a|~! 


provides a bounded, weak solution to the 2 x 2 system 


2 
Ry 82) Gahan 
1+ [ul 
(12b.3) i 
Aug + 32 ul? = 0, 
1+ ful? 


belonging to H'(B), for any ball B C R?, centered at the origin, of radius r < 
1. Evidently, wu is not continuous at the origin; one can also see that Vu does 
not belong to L?(B) for any p > 2. (After all, that would force u to be Hélder 
continuous.) Thus Theorem 12.4 and Proposition 12.5 do not extend to this case. 

The following result shows that if a weak solution to such a semilinear system 
as (12b.1) has any Hélder continuity, then higher-order regularity results hold. 


Proposition 12B.1. Assume u € H? solves (12b.1) and B(x,u,p) is a smooth 
function of its arguments, satisfying 
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(12b.4) |B(x, u,p)| < C(p)?. 

Then, givenr > 0, s > —1, 

(12b.5) uec", fece—uec™. 
Proof. Write 

(12b.6) u= Ef —EB(a,u,Vu), modC™, 


where F € OPS). a is a parametrix for the elliptic operator L. We have Ef € 
C’*?, and, since u € H! > B(x, u, Vu) € L', we have 


EB(a,u,Vu) € H2-7 +2, Ye>0, o> a 
If s > 0, this implies 
(12b.7) wen CFT nee, 
for all p < oo, hence 
(12b.8) u € (H?-? + AT-%?)|4, VG (0,1). 


Results on such interpolation spaces follow from (6.30) of Chap. 13. If we set 
6 = 1/2 and take p large enough, we have 


(12b.9) ue Hit/2-2242 Yee (0,1), o> 
l+e 


On the other hand, if we set 90 = (1 — a) /(2 — r), (assuming r < 1), we have 


1—4r 
(12b.10) uc H4, Va< ; =; 
—T 

hence 

i 1— $r r 
(12b.11) B(az,u,Vu) € L4, Va< == eg.g=1+ 5. 
Another look at (12b.6) now yields 
(12b.12) u € H* 47/2 4 HT? Ip < 00, 


provided s > 0, which is an improvement of (12b.7). We can iterate this argument 
until we get (12b.5), provided s > 0. 
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If instead we merely assume s > —1, then, instead of (12b.7), we deduce from 
(12b.6) and E B(x, u, Vu) € H?-°!+* that 


(12b.13) EB(a,u, Vu) E€ H2-olte A He oP 
and hence (parallel to (12b.8)—(12b.11)) that 


EB(x,u,Vu)€ () (H?-7t*, HT? ly 
(12b.14) 9€(0,1) 
Cc Ht?r/2-0,2 a Aletr 


so another look at (12b.6) gives 


ue Hit, 
hence 
(12b.15) B(x, u, Vu) € LAt*/?, 
so 
(12b.16) EB(a,u, Vu) € H21+"/2 9 yee, 


and we can iterate this argument until (12b.5) is proved. 


Note that Proposition 12B.1 applies to the semilinear system (11.25) for a 
harmonic map u : 2 —> X, where X is a submanifold of RY: 


(12b.17) Au —T(u)(Vu, Vu) = 0. 


On the other hand, there are quasi-linear equations with a somewhat similar struc- 
ture that also arise naturally in geometry, such as the system (4.94) satisfied by 
the metric tensor, in harmonic coordinates, when the Ricci tensor is given. This 
system has the following form, more general than (12b.1): 


(12b.18) S¢ dja7* (x, u)Opu + B(x, u, Vu) = f. 


We assume that a/*(x, u) and B(a,u, p) are smooth in their arguments and that 
(12b.4) holds. Recall that we have established one regularity result for such a 
system in § 4, namely, if n = dim 9 and n < gq < p < ow, then 


(12b.19) ué€ H'4, fe H®? —>ue Hs? 


if s > —1. Here, we want to weaken the hypothesis that u € H 14 for some g > n, 
which of course implies u € C”, r = 1 — n/q. We will establish the following: 
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Proposition 12B.2. Assume that u € Ht? solves (12b.18) and that B(x,u,p) 
satisfies (12b.4). Also assume u € C” for some r > 0. Then 


NE 


(12b.20) fe sue He, Vee (0,1), o> Tre? 


and, ifl1<p<@, 
(12b.21) fel? —ue H’?, 
More generally, for s > 0, 
(12b.22) fe H®? ue He??, 
To begin the proof, as in the demonstration of Proposition 4.9, we write 
(12b.23) S- a™ (x, u) O,u = A;(uyn,.D)u, 
mod C'™, with 
(12b.24) ue C" => Aj(uyz,8) € CST) N Si, + Sty". 
Hence, given 6 € (0,1), 


Aj(u;2,€) = A¥ (a, €) + Ab(z, 6), 


(12b.25) 
A¥(x,€)€ Sts, Ab(w,€) € S17”. 


Thus we can write 


(12b.26) S > dja7*(a, £) Ou = P#ut PPu, 
with 

(12b.27) P# = 5° 0;A*(«,D) € OPS{.5, elliptic 
and 

(12b.28) P= > OAD). 

Then we let 

(12b.29) E* € OPS;3 


be a parametrix for P, and we have 
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(12b.30) u = —E* P’u+ E* B(x, u, Vu) + E* f, 
mod C®, and if u € C’, 
(12b.31) Pere? 3H See, Peo, 
provided 1 < p < co ando —2+ 76 > —1,s0 
(12b.32) o>1-r6. 
Therefore, our hypotheses on wu imply 
(12b.33) E* Pky € Ht? , 
Now, if u € H'(Q), then (12b.4) implies 
(12b.34) B(x, u, Vu) € L', 
so, for small e > 0, o > ne/(1 +6), 
(12b.35) E* B(x,u, Vu) € H2-ot€, 

Hence we have (12b.30), mod C®, with 


E# Poy © W152, E* B(z,u, Vu) € H2-ol€, 


(12b.36) pivewieate 


This implies 
we Hitrolte 


hence, by (12b.31), 
(12b.37) Bt Poy c Hit2rslte, 
Another look at (12b.30) gives 


Uc yit2rd,1+e if 1+2rd6<2- on 


(12b.38) 
He-o1+© if 14 2r6 > 2-0. 


If the first of these alternatives holds, then 
E# Poy E Hit3r6,1te | 


We continue until the conclusion of (12b.20) is achieved. 
Given that wu € C” and that (12b.20) holds, by interpolation we have 
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(12b.39) we [H?-o*, A oP], Ve (0,1), 


@? 
using C’ C H’~%”, Va >0,p < oo. If we take 6 = 1/2 we get 


1 1 1 


Hit/2-0.4 _ 
"  q 2+4+2e 2p 


> 


hence, taking p arbitrarily large, we have 


ne 
lte 


(12b.40) pe ees. Veet te. aS 


Note that this is an improvement of the original hypothesis that u € H1!:?. On the 
other hand, if we take 9 = (1 — 0) /(2 — r), we get 


1—tr 
(12b.41) ue H"4, Va< ; -_ 
an A 
sO 
1—tr 
(12b.42) B(a,u,Vu) € L4, Vq< ; — 
=F 
Hence 
(12b.43) E* B(x,u, Vu) € H*4. 


Meanwhile, by (12b.40), 

(12b.44) BPP ye Briere. 
On the other hand, if we set 

(12b.45) q=ie _ 


which satisfies the condition in (12b.41), we can take 9 © r/(2 +r) in (12b.39) 
and get 


4+r? 
12b.46 Het, Va< 
( ) WE ; a was 
hence 
4 2 
(12b.47) E#P’ye HP, Vp < it are, 


2+9r 
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Note that 


4+ r? 


12b.48 
( ) 2+r 


1 
PSAP TOT Se po 


which is > 2, for any given r € (0,1), if 6 is taken close enough to 1. Now, 
another look at (12b.30) establishes the following special case of (12b.21): 


(12b.49) 122 ie f © LQ) = ue H?”. 


Under the hypotheses that wu € C™ and that (12b.49) holds, we have, parallel to 
(12b.39), 


(12b.50) ué [H*?, H™-*2], V0 (0,1), 


for allo > 0, Q < oo. As before, we can take 0 = 1/(2 — r) and get 


1 


(12b.51) we Ht, Wq< = 2" p. 
=] 


Hence, parallel to (12b.43), and as before using 1 + r/2 < (1—1r/2)/(1—1r), we 
have 


(12b.52) E* B(x, u, Vu) € H2+r/2), 
Similarly, if we take 6 + r/(2 +r) in (12b.50), we get 


442 
(12b.53) ue Het /2)P yc = Be 


+r 


and hence 


E*Pby ce HOCT/2)P p< 


As before, given r € (0,1), we can choose 6 close enough to 1 that p > 2. 
Another look at (12b.30) establishes that 


2 
(12b.54) eae (1 na =) , fe LQ) ue BH”. 


Now we can iterate this argument repeatedly, and since, for all r > 0, we have 
(1+ 7/2)" — co as k — 00, we obtain (12b.21). 
We next want to weaken the requirement of Holder continuity on w. 


Proposition 12B.3. Let u € H'(Q) solve (12b.18). Assume the very strong 
ellipticity condition 


254 14. Nonlinear Elliptic Equations 
(12b.55) alg(@,U)CjaKa = Aol¢|?, Ao > 0. 


Also assume B(x,u, Vu) is a quadratic form in Vu. Assume furthermore that u 
is continuous on Q. Then, locally, if p > n/2, 


(12b.56) fe MP = Vue M3, forsomeq>n. 
Hence u € C", for some r > 0. 

To begin, given x € Q, shrink 2 down to a smaller neighborhood, on which 
(12b.57) |u(a) — uo| < EB, 


for some ug € R™ (if (12b.18) is an M x M system). We will specify EF’ below. 
With the same notation as in (12.22), write 


(12b.58) (Oja!*(x, u) Opu, w) p> = =f (Fu, Vu) dx, 


so Gale u) determines an inner product on T* @R™ for each x € ©, ina fashion 


that depends on u, perhaps, but one has bounds on the set of inner products so 
arising. Now, if we let w € C§°(Q) and w = w(x)?(u — uo), and take the inner 
product of (12b.18) with w, we have 


[iver dx +2 / o(Vu)(Vv)(u— uo) dx 
(12b.59) - [vu — uo) B(x, u, Vu) dx 
= - { Pfu- Ug) dx. 

Hence we obtain the inequality 


fe livur — ju — uo| - |B(a, u, Vu)| — 6°|Vul?] dx 
(12b.60) : 
< Bf iveria— wo det f aPisl- fuel ae, 


for any 6 € (0,1). Now, for some A < oo, we have 
(12b.61) |B(a,u, Vu)| < AlVul?. 
Then we choose F in (12b.57) so that 


(12b.62) PASI a <1; 
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Then take 6? = a/2, and we have 


2 
(12b.63) sf vival? ae < = f [Vol -Ju—uol? de+ f 4?) f|-|u—uo de. 
Now, given x € Q, for R < dist(x, OQ), define U(a, R) as in (12.67) by 


(12b.64) UG R= h- ‘| |u(y) — ue,r|? dy, 


Br(2) 


where, as before, uz pz is the mean value of ul Br(a)" The following result is 


) 
analogous to Lemma 12.11. Let Ag be the constant produced by Lemma 12.12, 


applied to the present case, and pick p such that Agp? < 1/2. 


Lemma 12B.4. Let O CC Q. There exist Ro > 0, 0 < 1, and Co < 00 such 
that ifx € O andr < Ro, then either 


(12b.65) U(x,r) < Cor22-"/P) , 
or 
(12b.66) U(a, pr) < 0U(a,r). 


Proof. If not, there exist z, € O, R, > 0, J) > 1, and u, € H1(0,R™) 
solving (12b.18) such that 


(12b.67) U.,(a,, Ry) = 2 > CoR20-"/?) 
and 
(12b.68) UL(xy, pRv) > 0,UL(ap, R). 


The hypothesis that u is continuous implies ¢, — 0. We want to obtain a contra- 
diction. 
As in (12.81), set 


(12b.69) vy (x) =e, Paes + R,x) - live | 
Then vu, solves 
0,034 (ay + Rix, evur(x) + tive, H,) O,v8 


(12b.70) R? 
+e, B(x, + Ry2,évv,(«) + Ue,,k,,Vvv(2)) = ff. 
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Note that, by the hypothesis (12b.67), 


Re 1 


12b.71 te R, 
( ) Ep . Co - 
Now set 
(12b.72) V_(0,r) =r7” / Jur (y) — ore dy. 
B,.(0) 
Then, as in (12.84), we have 
(12b.73) V_(0,1) = lle llZ2B, (0) =1, V,(0,p) >. 


Passing to a subsequence, we can assume that 


(12b.74) v, + v weakly in L?(B,(0),R™), e,v, 30 ae. in By (0). 


Also, as in (12.87), there is an array of constants he such that 


(12b.75) a! (ay + Rx, eyvy (2) + Ue,,r,) —> big ae. in By (0), 


and this is bounded convergence. 

We next need to estimate the L?-norm of Vv,, which will take just slightly 
more work than it did in (12.88). 

Substituting €,v, ((x—2,)/R_) +Uvx,,R, for up(x) in (12b.63), and replacing 
uo by Uya,,R,» We have 


gf Ole) 


2 
(12b.76) rs = f Re\vur 


for € CE (Br, (ay). Actually, for this new value of uo, the estimate (12b.57) 
might change to |u(x) — uo| < 2E, so at this point we strengthen the hypothesis 
(12b.62) to 


(12b.77) (pAa 1 = qe, 


in order to get (12b.76). Since R2/e, < RP? /Co, we have, for U(x) = W(x, + 
Rix) € CX (Bi(0)), 
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a 2 Ry? 
(12b.78) 5 | wiver dx < = f ivuPieP dz + [Velen dx, 
0 


where F(x) = f(a, + RZ). 
Since ||v,||2(B,(0)) = 1, if V < 1, we have 


1/2 
(12b.79) per -|u,| dx < ( / |F|? du) <O,R;"/? 
Bi (0) 


if f € M2, so we have 
2 C 
(12b.80) 5 | vive? da < ~ f IVeP le? dx + —|[f lace. 
2 a Co 7 


This implies that v, is bounded in H! (B,(0)) for each p < 1. Now, as in (12.89), 
we can pass to a further subsequence and obtain 


Uy —> v strongly in Li,,(B1(0)) 


(12b.81) ie 
Vu, — Vv weakly in Li,,(B1(0)). 


Thus, as in (12.90), we can pass to the limit in (12b.70), to obtain 
(12b.82) 0;b)',0x0° =0 on By (0). 

Also, by (12b.73), 

(12b.83) V(0,1) =llullz2a@) <1, V(O,p) > 1. 


This contradicts Lemma 12.12, which requires V(0, p) < (1/2)V(0, 1). 


Now that we have Lemma 1|2B.4, the proof of Proposition 12B.3 is easily com- 
pleted, by estimates similar to those in (12.69)-(12.73). 
We can combine Propositions 12B.2 and 12B.3 to obtain the following: 


Corollary 12B.5. Letu € H'(Q) NC(Q) solve (12b.18). If the very strong ellip- 
ticity condition (12b.53) holds and B(x, u, Vu) is a quadratic form in Vu, then, 
givenp > n/2, q € (1,00), s > 0, 


(12b.84) fe MPO H*4* => uc W844, 


We mention that there are improvements of Proposition 12B.3, in which the 
hypothesis that wu is continuous is relaxed to the hypothesis that the local oscilla- 
tion of u is sufficiently small (see [HW]). For a number of results in the case when 
the hypothesis (12b.4) is strengthened to 
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|B(a,u,p)| < C{p)", 


for some a < 2, see [Gia]. Extensions of Corollary 12B.5, involving Morrey space 
estimates, can be found in [T2]. 

Corollary 12B.5 implies that any harmonic map (satisfying (12b.17)) is smooth 
wherever it is continuous. An example of a discontinuous harmonic map from R? 
to the unit sphere S? C R? is 


(12b.85) u(x) = 


It has been shown by F. Helein [Hel2] that any harmonic map u: 2 — M froma 
two-dimensional manifold Q into a compact Riemannian manifold M is smooth. 
Here we will give the proof of Helein’s first result of this nature: 


Proposition 12B.6. Let 2 be a two-dimensional Riemannian manifold and let 
(12b.86) u:A— s™ 
be a harmonic map into the standard unit sphere S™ C R™*+. Thenu € C®(Q). 


Proof. We are assuming that u € Hit, (Q), that w satisfies (12b.86), and that the 
components u; of u = (u1,...,Um-41) satisfy 


(12b.87) Au; + u;|Vul? = 


Here, Au; and |Vu|? = 5>|Vuel|? are determined by the Riemannian metric on 
Q, but the property of being a harmonic map is invariant under conformal changes 
in this metric (see Chap. 15, § 2, for more on this), so we may as well take Q to 
be an open set in R?, and A = 0? + 03 the standard Laplace operator. Now 
|u(a)|? = 1 a.e. on Q implies 


mt+1 
(12b.88) Xt (Oju;) i= 1,2, 


and putting this together with (12b.87) gives 


m+1 
(12b.89) Au; =- oy (uj Vu —u,Vuj)Vur, Wi. 
k=1 


On the other hand, a calculation gives 


(12b.90) div(ujVug — unVuj) = S— Oe(ujOeux — uxdeu;) = 0, 
£ 
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for all j and k. Furthermore, since u € H}.(Q) MN L*(Q), 
(12b.91) ujVug — upVu; € L2(Q), Vux € L2,(Q). 
Now Proposition 12.14 of Chap. 13 implies 


(12b.92) So (uj Vr — upVuj) Var = fy © Dioo(Q), 
k 


where $};,,((2) is the local Hardy space, discussed in § 12 of Chap. 13. Also, by 
Corollary 12.12 of Chap. 13, when dim 2 = 2, 


(12b.93) Auz = — Fj © Bigg(Q) => uz € C(O). 
Now that we have u € C(Q), Proposition 12B.6 follows from Corollary 12B.5. 


If dim 2 > 2, there are results on partial regularity for harmonic maps u : 2 > 
M, for energy-minimizing harmonic maps [SU] and for “stationary” harmonic 
maps; see [Ev4] and [Bet]. See also [Si2], for an exposition. On the other hand, 
there is an example due to T. Riviere [Riv] of a harmonic map for which there is 
no partial regularity. 

We mention another system of the type (12b.1), the 3 x 3 system 


(12b.94) Au = 2Hu, X ty on Q, u=g on OQ. 


Here H is a real constant, Q is a bounded open set in R?, and g € C®(, R®). 
We seek u : 2 —+ R®. This equation arises in the study of surfaces in R® of 
constant mean curvature H. In fact, if © C R? is a surface andu: Q > Na 
conformal map (using, e.g., isothermal coordinates) then, by (6.10) and (6.15), 
& has constant mean curvature H if and only if (12b.94) holds. In one approach 
to the analogue of the Plateau problem for surfaces of mean curvature H, the 
problem (12b.94) plays a role parallel to that played by Au = 0 in the study of 
the Plateau problem for minimal surfaces (the H = 0 case) in § 6. For this reason, 
in some articles (12b.94) is called the “equation of prescribed mean curvature,” 
though that term is a bit of a misnomer. 
The equation (12b.94) is satisfied by a critical point of the functional 


1 2 
(12b.95) J(u) = [giver + 5H(u- ts x uty) } dx dy, 
Q 


acting on the space 


(12b.96) V = {u € H'(0,R*) :u=g on AN}. 
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That J is well defined and smooth on V follows from the following estimate of 
Rado: 


1 3 
(12b.97) Vu) — Vg)? < S—(IVullze + [IVollz2)", 


provided u = g on 02, where 


(12b.98) V(u) = ic “Uz X Uy) dx dy. 
Q 

The boundary problem (12b.94) is not solvable for all g, though it is known to 
be solvable provided 
(12b.99) [Z| - |Ig|lz~ <1. 
We refer to [Str1] for a discussion of this and also a treatment of the Plateau prob- 
lem for surfaces of mean curvature H, using (12b.94). Here we merely mention 
that given u € H'(Q, R®), solving (12b.94), the fact that 
(12b.100) u € C(O, R3) 
then follows from Corollary 12.12 and Proposition 12.14 of Chap. 13, just as in 


(12b.93). Hence Corollary 12B.5 is applicable. This result, established by [Wen], 
was an important precursor to Proposition 12.13 of Chap. 13. 


13. Elliptic regularity [V (Krylov—Safonov estimates) 


In this section we obtain estimates for solutions to second-order elliptic equations 
of the form 


(13.1) Lu=f, Lu=a!*(x)0;0,u + 0 (a) Ojut c(a)u, 


on a domain 2 C R”. We assume that a/ ie b), and © are real-valued and that 
al® € L©(Q), with 


(13.2) AEP < a*(a)EjEe < AlED?, 
for certain \, A € (0,00). We define 
(13.3) D = det(a’*), D, =D". 


A. Alexandrov [A]] proved that if |b]/D,. € £"(Q) and c < 0 on Q, then 
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(13.4) ue C(Q)NHZ"(Q), Lu> fonQ, 

implies 

(13.5) sup u(x) < sup ut(y)+Cl|Dr'f\lz(a), 
rE yEedQ 


where C = C(n, diam Q, ||b/D.||1»). We will not make use of this and will not 
include a proof, but we will establish the following result of I. Bakelman [B], 


essentially a more precise version of (13.5) for the special case b? = c = 0 
(under stronger regularity hypotheses on w). It is used in some proofs of (13.5) 
(see [GT]). 


To formulate this result, set 


It = {yeQ: u(x) < uly) +p-(e@—-y), Veen, 


13.6 
oe for some p = p(y) € R"}. 


If u € C1(Q), then y belongs to [+ if and only if the graph of u lies everywhere 
below its tangent plane at (y, u(y)). If uw € C?(Q), then wu is concave on I'*, that 
is, (O;0,u) < OonT*. 


Proposition 13.1. [fu € C?(Q) 9 C(Q), we have 


d -1/ 5k 
(13.7) sup u(x) < — u(y) + ayar lls (a 3jK%)|| party: 


where d= diam Q, and V,, is the volume of the unit ball in R”. 
To establish this, we use the matrix inequality 
i n 
(13.8) (det A) (det B) < (- Tr AB) 
n 


for positive, symmetric, n x n matrices A and B. (See the exercise at the end of 
this section for a proof.) Setting 


(13.9) A=—H(u) =—(dj;Ou(z)), B=(a!*(z)), xe, 
where H(u) is the Hessian matrix, as in (3.7a), we have 
= ae “ + 
(13.10) ldet H(u)| <D (-<2 0;qu) on Pt, 
n 


Thus Proposition 13.1 follows from 


Lemma 13.2. For u € C?(Q) M C(Q), we have 
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d 1/n 
(13.11) sup u(x) < sup u(y) + al \det H(u)| dx) : 
rEQ yeaa Vai” Es 


Proof. Replacing u by u — supgg_u, it suffices to assume u < 0 on 0. Define 
X(@) to be U,,cq X(y)s where 


(13.12) x(y) = {p € R®: u(x) < uly) +p: (x —y), Va € O}, 


so y(y) 404 y EI. Also, if u € C'(Q) (as we assume here), 


(13.13) x(y) ={Du(y)}, fory eT. 

Thus the Lebesgue measure of y(Q) is given by 

(13.14) £"(x(Q)) = £"(x(PT)) = L"(Du(Et)) < i] |det H(w)| dex. 
Tt+ 

Thus it suffices to show that if u € C(Q) 9 C?(Q) and u < 0 on AQ, then 


d 
(13.15) sup u(x) < ne” (x(Q)). 
rE Vn 


This is basically a comparison result. Assume sup wu > 0 is attained at x. Let 
W, be the function on 2 whose graph is the cone with apex at (zo, u(zo)) and 
base OQ. x {0}. Then, if yy, (y) denotes the function (13.12) with u replaced by 
W), we have 


(13.16) Xu(Q) D xw, (2). 


Similarly, if W2 is the function on By(xo) whose graph is the cone with apex at 
(xo, u(2o)) and base {x : |a — ao| = d} x {0}, then 


(13.17) xwi (Q) D xw2 (Ba(20)). 


Finally, the inequality 


d 
(13.18) sup Wo < al" bon (Ba(zo))) 


is elementary, so we have (13.15), and hence Lemma 13.2 is proved. 


We now make the assumption that 
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A || 3 
ae — < =, <a 
(13.19) X Y, ( x ) V, d= VY; 


and establish the following local maximum principle, following [GT]. 


Proposition 13.3. Let u € H?-"(Q), Lu > f, f € L"(Q). Then, for any ball 
B= Bor(y) C Qand any p € (0,n], we have 


1 4 l/p R 
(13.20) 2e u(x) <C ana | ? de) + FIlFl 


L"(B) (> 


where C = C(n,7,vR?,p). 


Proof. Translating and dilating, we can assume without loss of generality that 
0 € Qand B = B,(0). We will also assume that u € C?(Q) MN H?”(Q), since 
if (13.20) is established in this case, the more general case follows by a simple 
approximation argument. 

Given (@ > 1, define 


(13.21) n(x) = (1—|al?)", for |2| <1. 
Setting v = ju on B, we have 


alk 0j;Onv = na)* O;Onu + 2a)* (On) (O,u) +uat® 0;0n 


(13.22) 
> (f — b’ Aju — cu) + 2a?*(0;7)(Opu) + uat® O;OgN. 


Let [7 be as in (13.6), but with wu replaced by v, and Q replaced by B. Clearly, 
u > OonT;t. We have 


(13.23) |Du] < —"— on Tt, 
1— |z| 
sO 
1 
|[Du| = 97 |Du — unl < —(—*_ + ulDnl) 
(13.24) n\1—|z| 


<2(1+ 8)n-/Fu on TS. 
Hence 


—a!* a;qv < { (168? + 28) An-?/9 + 28|b1n- V9 + chu + nf 


(13.25) 
< Cdn 2/8 y + f, 
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ont, where C = O(n, 3,7, v). Of course, a7*0;O,0 < 0 on Tt. If B > 2, we 
have, upon applying Proposition 13.1 to v, 


1 
sup uv < C{ [n/a | ney a 5 lIfllexce) 


(13.26) 
- 1 
<Ci { (sup ut) gad aa aa Pree + x lf llc } 


Choose 3 = 2n/p > 2, so we have 
/n n 
(13.27) sup v < Cy { (sup v*)" Sear ees 5llflincay}. 


(Here we allow p < 1, in which case || - ||z» is not a norm, but (13.27) is still 
valid.) Using the elementary inequality 


(13.28) qi P/ngpln < eq + ew (n/p-1) p, Ve € (0,00), 


and taking a = supg vt, b = |lut||z»(g), and e = 1/2C}, we have (the R = 1 
case of) (13.20), so Proposition 13.3 is proved. 


Replacing u by —u, we have an estimate on supz,,(1) (—u) when Lu < f. 
Thus, when Lu = f and the hypotheses of Proposition 13.3 hold, we have 


L”(B) 
Bry) 


(13.29) Iu <04 ( frac)? + 3 | 
, sup |U)S Vol(B) U XL X ra 
B 
Next we establish a “weak Harnack inequality” of [KrS], which will lead to 
results on Hélder continuity of solutions of Lu = f. This result will also be 
applied directly in the next section, to results on solutions to certain completely 
nonlinear equations. 


Proposition 13.4. Assume u € H?-"(Q), Lu < f inQ, f € £"(Q), andu > 0 
ona ball B = Bor(y) C 9. Then 


1/p 
1 R 
. —— P < i — n 
(13.30) nes ic de} <C(inf w+ Fllfllanw), 


Br 
for some positive p = p(n, y,v.R?) and C = C(n,y,vR?). 


As before, there is no loss of generality in assuming B = B,(0). Also, replac- 
ing L and f by \~!L and \~'f, we can assume \ = 1. 
To begin the proof, take « > 0 and set 
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1 
G=utet|lfilr- (sy, w = log —, 


(13.31) 


v= TW, G= 


? 


Sl [Ss 


where 77 is given by (13.21). Note that w is large (positive) where w is small. We 
have 


—a)* 0;0,0 = —nai* 0;0,w — 2a7*(0;n)(O,w) — wa?® O;Onn 
< n{[—a?* (0;w)(Oxw) + wO;w + |e] + g] 
(13.32) — 2aI*(Ojn)(Opw) — wat*® O;Onn 


2. 
< a (8in)(Oxn) — wal® 8; + (|B)? + lel + 9), 

where the last inequality is obtained via Cauchy’s inequality, applied to the inner 
product (V, W) = Vja/*W,,. 

Now the form of 7 implies that a/* 0;0,n > 0 provided 2(3 — 1)aI* aja, + 
a!) |x|? > a/J, and hence 
(13.33) 26|a|? > nA => a?* 0;0yn > 0. 
Thus, if a € (0,1), then 


(13.34) B= >, el >a ai d;an > 0. 
a 


Hence, on the set Bt = {x € B: w(x) > 0}, we have 


ik = ak 0,8 
—al* OyOyv < AB (1 — |n?)?™ |e? + 0x8, sup — n ") 
oe + (lb? + lel +9) 
2nGA 
<46?A + |b]? + |el+ 94 an OXB a: 


Note that ||g||z»() < 1. Thus Proposition 13.1 yields 


(13.36) sup v < C(1+|luTIIz»(8,)); 
B 


with C = C(n,a,7,v). 

Note that if u satisfies the hypotheses of Proposition 13.4 and t € (0, co), then 
u/t satisfies L(u/t) < f/t, and the analogue of w in (13.31) is w — k, where 
k = log(1/t). The function g in (13.31) is unchanged, and, working through 
(13.32)-(13.36), we obtain the following extension of (13.36): 
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(13.37) sup 7(w—k) < C(1+ |In(w—k)*|lnwy), VEER, 
B 


with constants independent of k. 

The next stage in the proof of Proposition 13.4 will involve a decomposition 
into cubes of the sort used for Calderon—Zygmund estimates in §5 of Chap. 13. 
To set up some notation, given y € R”, R > 0, let Qr(y) denote the open cube 
centered at y, of edge 2R: 


(13.38) Qrly) = {x ER”: |z; -—yj|< R, 1S 7 <n}. 


If a < 1//n, then Qa = Qa(0) CC B. 

The cube decomposition we will use in the proof of Lemma 13.5 below can 
be described in general as follows. Let Qo be a cube in R”, let y > 0 be an ele- 
ment of L'(Qo), and suppose Joo pdx < t£L"(Qo), t € (0,00). Bisecting the 
edges of Qo, we subdivide it into 2” subcubes. Those subcubes that satisfy 
ie Qa? dx < t£"(Q) are similarly subdivided, and this process is repeated 
indefinitely. Let F denote the set of subcubes so obtained that satisfy 


[ew > 4£"(Q); 
Q 


we do not further subdivide these cubes. For each Q € F, denote by Q the sub- 
cube whose subdivision gives Q. Since £"(Q)/L”"(Q) = 2”, we see that 


1 
13.39 i< =o / de< 2", VQeF. 
( ) mO J g Q 


Also, setting F’ = Uocr Q and G = @p \ F,, we have 
(13.40) p<t, ae.in G. 


This subdivision was also done in the proof of Lemma 5.5 in Chap. 13. Let us 
also set F = Uger Q; since Q € F > Q € F, we have 


(13.41) fe dx < tL" (F). 


F 


In particular, when ¢ is the characteristic function yp of a measurable subset I of 
Qo, of measure < t - £"(Qo), we deduce from (13.40)-(13.41) that 


(13.42) OT =2"0n Fy each), 
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We have the following measure-theoretic result: 


Lemma 13.5. Let Qo be a cube in R”, w € E*(Qo), and, fork € R, set 
(13.43) Ty = {@ € Qo: w(2) < k}. 
Suppose there are positive constants 0 < 1 and C such that 


(13.44) sup (w—k)<C 
Q0NQsr (z) 


whenever k and Q = Q,(z) C Qo satisfy 
(13.45) LL. AQ) > 6L"(Q). 


Then, for all k € R, 


(13.46) sup (w—k)<C 


Qo 


( | eee) 
log 6 


Proof. We show by induction that 


(13.47) sup (w — k) < mC, 
Qo 


for any m € Z* and k € R such that £L"(T,) > dL" (Qo). This is true by 
hypothesis if m = 1. Suppose that it holds form = M € Z* and that £"(T,) > 
6!@+1¢"(Qo). Define I, by 


(13.48) Pe =U {Q3r(z) 1 Qo : L"(Qr(z) AT) = 5 L"(Qr(z)) 
Applying the estimate (13.42), with t = 6, we see that either Ty = Qp or 
(13.49) L°(T,) > 61 L7(T) = 5” vol(Qo), 

and hence, replacing & by k + C, we obtain 


(13.50) sup(w —k) < (M+1)C, 
Qo 


which verifies (13.47) form = M +1. 
Now, the estimate (13.46) follows by choosing m appropriately, and the lemma 
is proved. 


Returning to the estimation of the functions defined in (13.31), we see that 
(13.36) implies 
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(13.51) sup v <C(1+ [lo Ilzm(q.)) < C(I + [vol(Qz)]” sup v*), 
B B 


where Qa = Q..(0), as stated below (13.38), and 
Qt = {x € Qo: u(x) > 0} = {2 € Qa: U2) < 1}. 


Hence, if C is the constant in (13.36), 


(13.52) 


woW'Qi) < 


= 5 < * 
vol(Qa) — eG) a= ee vs2c 


Now choose a = 1/3n, and take 6 = (4aC’)~, as in (13.52). Using the coor- 
dinate change x +> a(x — z)/r, we obtain for any cube Q = Q,(z) such that 
Bsnr(z) C B, the implication 


+ 
vol(Q ip ass, sup w<C(n,7,v). 


os vol(Q) Qsr(2) 


With a and @ as specified above, take 6 = 1 — 0, Qo = Qa(0), and note that 
the estimate (13.53) holds also when w is replaced by w — k, and Q* is replaced 
by the set {x € Q: w(x) — k > O}, as a consequence of (13.37). Let 


(13.54) u(t) = L"({x € Qo : U(x) > t}). 
Setting k = log 1/t, we have from Lemma 13.5 the estimate 


(13.55) ply = C(inf tu), VESs, 


where C = C(n,7,v), & = «(n,7,v). Replacing the cube Qo by the inscribed 
ball By (0), a = 1/3n, and using the identity 


(13.56) fo dx = rf tP-1y(t) dt, 
Qo ° 
we have 
(13.57) [or ac < Clint 0)", torp= §. 
r : 


The inequality (13.30) then follows by letting « —> 0 if we use a covering 

argument to extend (13.57) to arbitrary a < 1 (especially, a = 1/2) and use the 

coordinate transformation x +> (a—y)/2R. Thus Proposition 13.4 is established. 
Putting together (13.29) and (13.30), we have the following. 
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Corollary 13.6. Assume u € H?-"(Q), Lu = f onQ, f € L™(Q), and u > 0 
ona ball B = Bar(y) C 9. Then 


R 
13.58 sup u(#)<C i( inf w+ — ee : 
(13.58) gup ule) SCi(, inf wt 5 Ifllencosny) 


for some C, = C,(n,7,vR?). In particular, if u > 0 on Q, 


(13.59) Lu=0= > sup u(x) <C, inf u(z). 
Briy) Bor(y) 


We can use this to establish Hélder estimates on solutions to Lu = f. We will 
actually apply Corollary 13.6 to Ly; = aJ* 0,0, + 00;, so Lyu = fi; = f — cu. 
Suppose that 


(13.60) a= inf u< sup u=b. 
Bar(y) Bar(y) 


Then v = (u—a)/(b—a) is > 0 on Byr(y), and Liv = f,/(b—a), so Corollary 
13.6 yields 


Uu-a u-a R 


1 
13.61) s a" <C1(_ inf ! : ); 
(13.61) OD aa = ey =a Xba gf ~ cullen wan) 


Without loss of generality, we can assume C, > 1. Now given this, one of the 
following two cases must hold: 


U-—a 1 U-—a 
i C, inf >. su j 
@) Bath b-a eee b-—a 


U-a 1 U-a 
il C, inf <- su ; 
( ) Bata) b-—a 2 ae b-—a 


If case (1) holds, then either 


u-a 1 


inf j 
Bor(y) b—a ~ ACY 


and hence (since we are assuming C; > 1) 


1 
13.62 i) => ose u<{1l—-—— osc wu. 
® Bry) ( a) Bar(y) 


If case (11) holds, then 
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su < : : Il f ull 
ue, b-a” Ab-a CUllL™ (Bar)? 
SO 
(13.63) Gi) => ose u< on || f — cul 
‘ U U n 
Bry) r CULL” (Ban) 


which is bounded by C2 R in view of the sup-norm estimate (13.29). Consequently, 
under the hypotheses of Corollary 13.6, we have 


i 
(13.64) osc u< max(C2R, (1 se =) osc u), 
Br(y) 17 Bar(y) 


with C, = Ci(n,7,vR3), C2 = Co(n,y,v RB) [If ll ceca) + |lulle» a], given 
Baro (y) CQ, R < Ro. Therefore, we have the following: 


Theorem 13.7. Assume u€ H?"(Q), Lu= f, and f € L"(Q). Given O CCQ, 
there is a positive 1 = p(O,Q,n,¥, v) such that 


(13.65) lellou(o) S$ A(llullz»(ay + IIfllz»(ay), 
with A= A(O,Q,n,7, Vv). 


Some boundary regularity results follow fairly easily from the methods devel- 
oped above. For the present, assume (2 is a smoothly bounded region in R”, that 


2,n re) 
(13.66) we H"O)NC]), uso <9, 


and that Lu = f on Q. Let B = Bop(y) be a ball centered at y € OQ. Then, 
extending (13.20), we have, for any p € (0, n], 


1 1/p R 
13.67 <C +)P q R ; 
eae = Cae | (u") x) + x fll (BNQ) 7 > 


with C = C(n,7,vR?,p). To establish this, extend u to be 0 on B \ Q. This 
extended function might not belong to H?*"(B), but the proof of Proposition 13.3 
can still be seen to apply, given the following observation: 


Lemma 13.8. Assume that u satisfies the hypotheses of Proposition 13.1 and that 
Q CQ, and set u = 0 on Q\ Q. Then 


d 
(13.68) sup u<sup u+ a 
a = nv, /n 


Q 0Q. 


[Dea 85044) || in gy 
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where d = diam Q, and I+ is the upper contact set of u, defined as in (13.6), 
with Q, replaced by Q. 


Note that if u(a) > 0 anywhere on Q, then P+ CIt, 
The following result extends Proposition 13.4. 


Proposition 13.9. Assume u € H?"(Q), Lu= f onQ, u>00n BOQ. Set 
(13.69) m= inf u, 
BNaQ 


and 


u(x) =min(m,u(z)), «ce BNQ, 


(13.70) ne BVO. 
Then 
1/p 
(13.71) IED fo dx < C( inf ut llfllancane)); 
Br 


for some positive p = p(n, y,v.R?) and C = C(n,y,vR?). 


Proof. One adapts the proof of Proposition 13.4, with u replaced by w. One gets 
an estimate of the form (13.53), with w replaced by w — k, k > —logm. From 
there, one gets an estimate of the form (13.55), for 0 < t < m. But y(t) = 0 for 
t > m, so (13.71) follows as before. 


This leads as before to a Holder estimate: 


Proposition 13.10. Assume u € H?"(Q), Lu = f € L"(Q), ul, = 9 € 
C® (AQ), and 3 > 0. Then there is a positive = p(Q,n,y, Vv, 3) such that 


(13.72) lull ou @) < A( lull ix (o) + |lfllze(ay + Iellee(en): 
with A= A(Q,n,7¥,¥, 3). 

We next establish another type of boundary estimate, which will also be very 
useful in applications in the following sections. The following result is due to 
[Kry2]; we follow the exposition in [Kaz] of a proof of L. Caffarelli. 
Proposition 13.11. Assume u € C?(Q) satisfies 


(13.73) Lu =f, 0. 


tl on = 
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Assume 

(13.74) Iz4|| n-o(ay + |Vullt~cay + If ilze(ay < K- 

Then there is a Hélder estimate for the normal derivative of u on OQ: 
(13,75) |Qvullca(an) < CK, 


for some positive a = a(Q,n,v,A, A, K) and C = C(Q,n, v, A, A). 


To prove this, we can flatten out a portion of the boundary. After having done 
so, absorb the terms b’(x)0;u + c(x)u into f. It suffices to assume that 


(13.76) Lu=f on Bt, Lu=a!*(x) 0;O,u, 


where 
Bt ={2 ER": |z| <4, an > 0}, 


and that 

(13.77) u=0 on Y={xeER”: |2| < 4, x, =O}, 

and to show that there is an estimate 

(13.78) |Onullcen) < CK, C=C(n,A,A), 

where K is as in (13.74), with 0 replaced by BT, a = a(n, A, A, K) > 0, and 
(13.79) T={xeD:|z| <1}. 


Note that, for (x’,0) € 5, 


(13.80) On u(x’, 0) = v(2’, 0), 
where 
(13.81) v(x) = 2, ‘u(z). 


Let us fix some notation. Given R < 1 and 6 = A/9nA < 1/2, let 


Q(R) = {a € Bt : |a'| < BR, 0< an < OR}, 


13.82 
: Qt(R) = {x € Q(R): SOR < ty < 5R} 


(see Fig. 13.1). Then set 
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x 


Q(R/2) 


FIGURE 13.1 Setup for Boundary Estimate 


(13.83) mpr= inf v, Mr= sup v, 
Q(R) Q(R) 
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SO OSCg(R) UV = Mr — mr. Before proving Proposition 13.11, we establish two 


lemmas. 


Lemma 13.12. Under the hypotheses (13.76) and (13.77), ifalsou > 0 on Q(R), 


then 


2 R 
v<= inf v+— sup |f|. 


13.84 
( ) a) Q(R/2) r 


inf 
Qt(R) 


Proof. Let y = inf{v(x) : |2’| < R, x, = OR}, and set 


2 
(13.85) z(x) = yrn, (6 - a? + 


1 
R 


1 
ttn) sen (OR Te) sup | f]. 
Given 6 € (0, 1/2], we have the following behavior on 0Q(R): 


Z(o\=0, tree (2,0), (bottom), 
(13.86) z(z) <0 on {x € Q(R): |a’| = R}, (side), 
2(x) <276°R<yd5R on {x € Q(R): an = 5R} (top). 


Also, 


a 
(13.87) Lz<-—sup |f|<f on Q(R) if 6 = —. 
9nA 
Since u > 0 on Q(R) and u = xv > YOR on the top of Q(R), we have 


(13.88) L(u—z)>0 on Q(R), uz on 0Q(R). 


Thus, by the maximum principle, u > z on Q(R), so u(z) > z(x)/xn on Q(R). 


Hence 
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6 R 
13. inf >=-(yvy- — ; 
(13.89) pitty 82 5(7 + sup |) 


Since y > infg+(r) v, this yields (13.84). 


Lemma 13.13. If u satisfies (13.76) and (13.77) and u > 0 on Q(2R), then 


(13.90) sii v<c( inf v+R su ), 
fae ones P I fl 


with C = C(n,d, A, K). 


Proof. By (13.58), ifx € Qt(R), r = 6R/8, we have 


(13.91) sup u< c( inf utr? sup Ifl)- 
B,(a) B, (2) 


Since 6R/2 < x, < 5R on QT(R), (13.90) follows from this plus a simple 
covering argument. 


We now prove Proposition 13.11. The various factors C;; will all have the form 


C; = C;(n, A, A, K). If we apply (13.90), with u replaced by u — marzp > 0, 
on Q(2R), we obtain 


13.92 : —_ <C inf _ Rs ; 
(13.92) sup (oman) < 1( inf, (e— man) +B sup |fl) 


By Lemma 13.12, this is 


<C2(_ inf v-—m +Rsu ) 

(13.93) 2 Ba 2R) p If| 

— C2(mrj2 —meor+R sup lf). 

Reasoning similarly, with u replaced by Morxz, — u > 0 on Q(2R), we have 


(13.94) eon —v) < C2(Mor — Mprjo+R sup |f\). 
Qt(R 


Summing these two inequalities yields 
(13.95) Mor —™MerR < C3 [(Mor = mp) = (Mr/2 ma mR/2) +R sup fl], 
which implies 


(13.96) osc u<v osc v+R sup |f|, 


Q(R/2) Q(2R) 
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with 0 = 1—1/Cs3 < 1. This readily implies the Hélder estimate (13.78), proving 
Proposition 13.11. 


Exercises 
1. Prove the matrix inequality (13.8). (Hint: Set C = Al? > 0 and reduce (13.8) to 
(13.97) - Tr X > (det X)'/”, 
for X = CBC > 0. This is equivalent to the inequality 
(13.98) ZA test An) 2 Ones An)™, Ay > 0, 


which is called the arithmetic-geometric mean inequality. It can be deduced from the 
facts that log x is concave and that any concave function y satisfies 


(13.99) p(=0 eee An)) > ~[p(™) +++ + p(An)]-) 
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In this section we derive H6lder estimates on the second derivatives of real-valued 
solutions to nonlinear PDE of the form 


(14.1) F(a, Du) = 0, 


satisfying the following conditions. First we require uniform strong ellipticity: 
strongly 


(14.2) AEP < 06, F(a, u, Vu, Ou)EjEx < AlEl?, 


with \, A € (0,00), constants. Next, we require that F' be a concave function 
of ¢: 


(14.3) O¢ jn 8Cem F (£; U, P, CE jREem < 0, Fak = kis 


provided ¢ = 0?u(x), p = Vu(2). 
As an example, consider 


(14.4) F(x,u,p,¢) = log det ¢ — f(x, u,p). 


Then (D-F)= = Tr(¢~'), so the quantity (14.3) is equal to 
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(145) 0 -Te(C*SC*E) = — THC PEC HEC" M?), B=, 


provided the real, symmetric, n x n matrix ¢ is positive-definite, and ¢~!/? is the 
positive-definite square root of ¢~!. Then the function (14.4) satisfies (14.3), on 
the region where ¢ is positive-definite. It also satisfies (14.2) for 0?u(x) = ¢ € K, 
any compact set of positive-definite, real, n x n matrices. In particular, if F is a 
bounded set in C?({) such that (0;0,u) is positive-definite for each u € F, and 
(14.1) holds, with | f(a, u, Vu)| < Co, then (14.2) holds, uniformly for u € F. 

We first establish interior estimates on solutions to (14.1). We will make use of 
results of § 13 to establish these estimates, following [Ev], with simplifications of 
[GT]. To begin, let 1 € R” be a unit vector and apply O,, to (14.1), to get 


(14.6) Fe, Oj0j; 0, + Fy, OjO,U + Fy OnU + ue On, F = 0. 
Then apply 0, again, to obtain 


Fe,, Gi0j0;,U + (I¢.; Wve F) (G19; Iu) (POO) 


(14.7) ij 7 . 
+Aj} (x, Deu) 0,0;0,u + B(x, D°u) = 0, 


where 
Ave D?u) — 2(0¢;; Op. F) (OnO.U) a 2(0¢,; OuF)(Ouu) + 21" (Oni On.F); 


and B,,(x, D?u) also involves first- and second-order derivatives of F’. 
Given the concavity of F’, we have the differential inequality 


(14.8) Fy,, 0,0;04u > —AY 0,0;0,u — By, 


where A? = A¥J(a, D*u), B, = B,(a, D?u). If we set 


(14.9) i = (1+ Siu ) M =sup |6?u| 
OS a On ae 
then (14.8) implies 
C 3 
(14.10) —Fe,, i0jhy < =i (Ao|0?u| + Bo), 
where 
(14.11) Ao = Ao(llullez@), Bo = Bo(lullea@): 


Now let {44 : 1 < k < N} be acollection of unit vectors, and set 
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N 


(14.12) lig = Pigs j= > hy. 
k=1 


Use hy in (14.10), multiply this by hy, and sum over k, to obtain 


N 


1 C 
14.1 Fe,,(O:he)(Ojhe) — =Fe,, :0;0 < Ao|0°u| + Bo). 
(14.13) », Gy (Oilx) (jhe) — 5 Fey BOjv S pz (Aol Ful + Bo) 
Make sure that {uz, : 1 < k < N} contains the set 
(14.14) = f{ej:1<j<npu{2 “(ete :1<i<j<n}, 
where {e,} is the standard basis of R”. Consequently, 
N 
(14.15) |u|? = S— |A;0;deul? < 4(1 + M)? S~ |Oh,?. 
ig 0 k=1 
The ellipticity condition (14.2) implies 
N N 
(14.16) S> Fe; (Oikx)(Ojhn) => AS— |Ohe?. 
k=1 k=1 


Now, take ¢ € (0,1), and set 
(14.17) we = he + ev. 


We have 


N ; 1 x 2 1/2 Bo 
(14.18) eAy> |Dhx.| = aie OjOjwR <C Ao(>> |Ohx| ) + iat” 
=. k=1 


Thus, by Cauchy’s inequality, 


Cn (Az Bo 
14.19 00S Sa, a= *( 0 4 ve 
al) gE oy Ie Te 
We now prepare to apply Proposition 13.4. Let Br C Bar be concentric balls 
in 2), and set 
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Wrs =sup we, Mes =sup hey, mes = inf hg, 
Bsr Bsr Bsr 


(14.20) N N 
w(sR) = >> ose hy = d| (Mrs — Mke). 


Applying Proposition 13.4 to (14.19), we have 


1/p 


1 
a42y [| (Wie wx)? de) < C(Wia — Win +08), 
Br 


where p = p(n, A/A) > 0, C = C(n, A/A). Denote the left side of (14.21) by 


Oy, R(Wro = Wr). 


Note that 
Wo — we > Mae — he — 2ew(2R 
nie ne a > “~ — ia + a 
Hence 
(14.23) ®,. 2(Mi2 — he) < C{ Mio — Mii + ew(2R) + GR’}. 
Consequently, 


®p,n(Yo(Mi2 2 hn)) < NVPS © Oy (Mp2 — he) 
(14.24) k k 


< {(1 + €)w(2R) — w(R) + ZR’}. 


We want a complementary estimate on ®, r(he — mez). We exploit the con- 
cavity of F in ¢ again to obtain 


Bes (y, D*u(y)) (0;0;u(y) = 0,0; u(x)) 


ans < F(y, Du(y),0u(e)) — F(y, Du(y), Puy) 
= F(y, Du(y), 0’u(x)) — F(x, Du(x), 0? u(z)) 
ss Do|x _ yl; 

where 


(14.26) Do = Do(llullo2@))- 
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The equality in (14.25) follows from F(x, D?u) = 0. At this point, we impose a 
special condition on the unit vectors j4z used to define hy, above. The following is 
a result of [MW]: 


Lemma 14.1. Given 0 < \ < A < o, let S(\, A) denote the set of positive- 
definite, real, n x n matrices with spectrum in |X, A]. Then there exist N € Z* 
and \* < A* in (0,00), depending only on n,X, and A, and unit vectors pun, € 
R", 1<k<N, such that 


(14.27) {pi 1<k<N}DY, 


where \ is defined by (14.14), and such that every A € S(X, A) can be written in 
the form 


N 
(14.28) A=) BrPanr Be © DA), 
k=1 


where P,,, is the orthogonal projection of IR" onto the linear span of [tr. 


Proof. Let the set of real, symmetric, n x n matrices be denoted as Symm(n) 
a R™"+1/2_ Note that A € Symm(n) belongs to S(A, A) if and only if 


Alu? < v- Av < Alv|?, Vu eR”. 


Thus S(A, A) is seen to be a compact, convex subset of Symm(n). Also, S(A, A) 
is contained in the interior of S(A1, Ay) if0 < A. <A<A< Aj. 

It suffices to prove the lemma in the case A = 1/2n. Suppose 0 < \ < 1/2n. 
By the spectral theorem for elements of Symm(n), S(A/2,1/2n) is contained in 
the interior of the convex hull CH(P) of the set 


P={O}U{P,: we S*™* CR}. 


Thus, there exists a finite subset 21 D L of unit vectors such that S(\/2,1/2n) is 
contained in the interior of CH (Po), with Po = {0} U{P, : w € Wh. Write 2 
as {un : 1 <k < N}. Then any element of S(\/2, 1/2n) has a representation of 
the form ee BrrPag» with 3, € [0, 1]. 

Now, if we take A € S(A, 1/2n), it follows that 


N 
i » syn ° s(5. a) 


1 


so A = yp, (Ge + A/2N)P,, has the form (14.28), with G, = 8, + A/2N € 
[A/2N, 2]. This proves the lemma. 
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If we choose the set {uu : 1 < k < N} of unit vectors to satisfy the condition 
of Lemma 14.1, then 


Fe, (y, D?u(y)) (0,0;u(y) = 0,0; u(x)) 


N 
(14.29) - » Br(y) (Oi, u(y) — 2, u(a)) 


N 
= 2(1+ M)S~ Bx(y) (he(y) — he(x)), 


k=1 


with 6,(y) € [A*, A*]. Consequently, for 2 € Bor, y € Br, we have from 
(14.25) that 


N 

(14.30) S> Be(y) (Rely) — ha(x)) < OAMR, fi = 
k=1 

Hence, for any @ € {1,..., N}, 


he(y) — mez < 52 {CMR + A* J (Mrz — he())} 
(14.31) hes 
< C{GR + S>(Mrz — hely)) }, 


k#Xe 


where C = C(n, A/X). We can use (14.24) to estimate the right side of (14.31), 
obtaining 


(14.32) ®, r(he — mez) < C{(1+¢)w(2R) —w(R)+f#R+ER?}. 


Setting € = k, adding (14.32) to (14.23), and then summing over k, we obtain 


(14.33) w(2R) < C{(1+ e)w(2R) —w(R)+fR+ ER}, 
and hence 
(14.34) w(R) < (1- 4 +¢)w(2R) + (iR+ ZR?) 


Now C is independent of ¢, though 7 is not. Thus fix ¢ = 1/2C, to obtain 
1 = 
(14.35) w(R) < (1- 5c) w(2R) + (iR+7R?). 


From this it follows that if Bgrz, C Q and R < Ro, we have 
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R 
Ro 


(14.36) osc Pu < C( ya + M) (1+ jiRo + @R2), 


where C' and a are positive constants depending only on n and A/X. We have 
proved the following interior estimate: 

Proposition 14.2. Let u € C*(Q) satisfy (14.1), and assume that (14.2) and 
(14.3) hold. Then, for any O CC Q, there is an estimate 


(14.37) 07 ull ca(o) S CLO mA A, |F lez, Ilullc2@y): 


In fact, examining the derivation of (14.36), we can specify the dependence on 
O,. If O is a ball, and |x — y| > p forall x € O, y € OO, then 


(14.38) |Pullcaw@) < C(n,r,A, ||Flloz,lullex@)e*: 


We now tackle global estimates on 2 for solutions to the Dirichlet problem for 
(14.1). We first obtain estimates for Ol ies 


Lemma 14.3. Under the hypotheses of Proposition 14.2, if ul an = & there is an 
estimate 


(14.39) |Pullce(any < C(Q,n, A, A, ||F llo2, [lulle2@); ll¥llos(aay)- 


Proof. Let Y = b‘(x)0, be a smooth vector field tangent to 0Q, and consider 
v = Yu, which solves the boundary problem 


(14.40) Fc,,0:0;0 = G(a), =Yy, 


Ul 6a 


where 


(14.41) G(x) = 2F¢,,(:B°) (jeu) + Fe, (Ai0;b°)Oeu 
+ Fy, (O:b°) (Ogu) — Fy,00 — Fyv — 00», F. 

The hypotheses give a bound on ||G'||,~(q) in terms of the right side of (14.39). 

If & € C?(Q) denotes an extension of Yy from OQ to Q, then Proposition 13.11, 

applied to v — a, yields an estimate 


(14.42) |.Yullce(any SC, 


where C is of the form (14.39). On the other hand, the ellipticity of (14.1) allows 


2 . eae . . 
one to solve for ul gq in terms of quantities estimated in (14.42), plus ul gq and 


Vu aq? and second-order tangential derivatives of u, so (14.39) is proved. 


We now want to estimate |0u(x) —02u(zo)|, given xp € OO, 4 € 0, 7 € R” 
a unit vector. For simplicity, we will strengthen the concavity hypothesis (14.3) to 
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strong concavity: 
(14.43) 0606 FG 6, O24 =A ey, == 
for some \y > 0, when ¢ = 0?u, p = Vu. Then we can improve (14.8) to 
(14.44) Fe, 0;0;(02u) < —AY 0,0;0,u — By — ro|6?,u]? < -Ci, 
by Cauchy’s inequality, where 

Cy =Ch (n, A, A, Ao, ||Ay(2, D*u)\|L~, ||B,(a, D?u)|| p<). 
Now the function 


(14.45) W(a) = Cola —a0|* (0<a<1) 


is concave on R” \ {xo}, and if Co is sufficiently large, compared to C{ - 
diam(Q)?~°/2, we have 


(14.46) IW<-Ci, Lv= Fy,, 0;v. 

Hence, by the maximum principle, 

(14.47) Ku < F (ao) +W on 00 => Bu < Ku(ao) +W on Q. 
Now the estimate (14.39) implies that the hypothesis of (14.47) is satisfied, pro- 
vided that also C2 > ||0?u||ca(aq), so we have the one-sided estimate given by 
the conclusion of (14.47). 


For the reverse estimate, use (14.25), with y = xo, together with (14.29), to 
write 


N 
(14.48) S~ Bx (xo) (0%, u(x0) — O7,u(ax)) < Do|x — ao]. 
k=1 


Recall that 3;(20) € [A*, A*], A* > 0. This together with (14.47) implies 
(14.49) |07,,,u(x) — 07, u(xo)| < Cl — x0|%, 


with C’3 of the form (14.39), and we can express any 0; 0pu as a linear combination 
of the OF. u, to obtain the following: 


Lemma 14.4. [f we have the hypotheses of Lemma 14.3, and we also assume 
(14.43), then there is an estimate 


(14.50) \07u(x) — 67u(axo)| < Cla —ao|%, 20 € ON, x EQ, 
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with 
(14.51) C = C(Q,n,4,A, Xo, ||Fllc, lullo2@); lellcs(any): 
We now put (14.38) and (14.50) together to obtain a Hilder estimate for 0?u 


on 2. To estimate |0?u(x) — 0? u(y)|, given x, y € ©, suppose dist(x, dQ) + 
dist(y, OQ) = 2p, and consider two cases: 


@) |e—y| <p’, 
Gi) ja — y| > p?. 


In case (i), we can use (14.38) to deduce that 
(14.52) |A?u(x) — O?u(y)| < Cla — y|*p~* < Cla — y|%/?. 


In case (ii), let x’ € OQ minimize the distance from x to OQ), and let y’ € OO 
minimize the distance from y to OQ. Thus 


anes, jz —2'|<2p<2\e-yl'/?, yy! < 2p < 2I\x—yl'”?, 
jz’ —y'| <|a—y|+|2’ —2|+|y! -y| < |e -—y| +4|x —y|'”. 
Thus 
|?u(x) — Ou(y)| < |O?u(x) — Ou(a’)| + |0?u(a’) — d?u(y’)| 
“asa + |Pu(y’) — Pu(y)| 


< Ola — 2!|* + Ola! — y'|* + Cly’ — y|* 
< Cla — y|%/?. 


In (14.52) and (14.54), C has the form (14.51). Taking r = a/2, we have the 
following global estimate: 


Proposition 14.5. Let u € C*(Q) satisfy (14.1), with til a = y. Assume the 


ellipticity hypothesis (14.2) and the strong concavity hypothesis (14.43). Then 
there is an estimate 


(14.55) |lullo2tn@ S C(O, 7, A, A, Ao, ll Fllc2, lullo2@); lllles(oa)), 
for some r > 0, depending on the same quantities as C. 


Now that we have this estimate, the continuity method yields the following 
existence result. For 7 € [0, 1], consider a family of boundary problems 


(14.56) F,(x,D?u) =0 on Q, u an = Yr: 
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Assume F’, and wy, are smooth in all variables, including 7. Also, assume that 
the ellipticity condition (14.2) and the strong concavity condition (14.43) hold, 
uniformly in 7, for any smooth solution u,. 


Theorem 14.6. Assume there is a uniform bound in C?(Q) for any solution u; € 


Ce (Q) of (14.56). Also assume that 0,,F°, < 0. Then, if (14.56) has a solution in 
C™(Q) for rT = 0, it has a smooth solution for T = 1. 


With some more work, one can replace the strong concavity hypothesis (14.43) 
by (14.3); see [CKNS]. 

There is an interesting class of elliptic PDE, known as Bellman equations, 
for which F(x, u,p,¢) is concave but not strongly concave in ¢, and also it is 
Lipschitz but not C'°° in its arguments; see [Ev2] for an analysis. 

Verifying the hypothesis in Theorem 14.6 that u, is bounded in C?(Q) can be 
a nontrivial task. We will tackle this, for Monge—Ampere equations, in the next 
section. 


Exercises 
1. Discuss the Dirichlet problem for 
Au+ Out 5(l + (Au)?)*/? =oe", 


fora > 0. 


15. Monge—Ampere equations 
Here we look at equations of Monge—Ampere type: 
(15.1) det H(u) — F(z,u, Vu) =O00nQ, w= yondQ, 


where {2 is a smoothly bounded domain in R”, which we will assume to be 
strongly convex. As in (3.7a), H(u) = (0;0,u) is the Hessian matrix. We assume 
F(a,u, Vu) > 0, say F(a,u, Vu) = exp f(x,u, Vu), and look for a convex 
solution to (15.1). It is convenient to set 


(15.2) G(u) = log det H(u) — f(z, u, Vu), 
so (15.1) is equivalent to G(w) = 0 on Q, u = v on OQ. Note that 
(15.3) DG(u)v = g?* 0;0xv — (Op, f) (a, u, Vu) Ojv — (Ouf) (a, u, Vu)a, 


where (g/*) is the inverse matrix of (0;0;u), which we will also denote as (gx). 
We will assume 
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(15.4) (Ouf)(x, u,p) 2 0, 


this hypothesis being equivalent to (0,,F')(x, u, p) > 0. 
The hypotheses made above do not suffice to guarantee that (15.1) has a 
solution. Consider the following example: 


(15.5) det H(u) — K(1+|Vul?)* =00nQ, u=0ondQ, 


where 2. is a domain in R?. Compare with (3.41). Let K be a positive constant. If 
there is a convex solution u, the surface © = {(x,u(x)) : « € Q} is a surface in 
IR? with Gauss curvature K. If Q is convex, then the Gauss map N : © + S$? is 
one-to-one and the image N(%) has area equal to K - Area(Q.). But V(X) must be 
contained in a hemisphere of S?, so we must have K - Area(Q2) < 27. We deduce 
that if kK - Area(Q) > 27, then (15.5) has no solution. 

To avoid this obstruction to existence, we hypothesize that there exists uw? € 
C%(Q), which is convex and satisfies 


(15.6) log det H(u?) — f(a,u°, Vu?) >0o0nQ, u? = yonan. 


We call uw? a lower solution to (15.1). Note that the first part of (15.6) is equivalent 
to det H(u?) > F(a, u°, Vu"). In such a case, we will use the method of conti- 
nuity and seek a convex u, € C™(Q)) solving 


log det H(u,) — f(a, us, Vue) 
(15.7) = (1—<a)[log det H(u’) — f(x, u?, Vu")| 
= (1—)h(z), 


foro € [0,1] and u, = y on OQ. Note that up = u? solves (15.7) for o = 0. If 
such wu, exists for all o € [0,1], then u = wy is the desired solution to (15.1). 

Let J be the largest interval in [0, 1], containing 0, such that (15.7) has a convex 
solution u, € C°% (Q) for all o € J. Since the linear operator in (15.3) is elliptic 
and invertible (by the maximum principle) under the hypothesis (15.4), the same 
sort of argument used in the proof of Lemma 10.1 shows that J is open, and the 
real work is to show that J is closed. In this case, we need to obtain bounds on 
Ug in C?+#(Q), for some ju > 0, in order to apply the regularity theory of § 8 and 
conclude that J is closed. 


Lemma 15.1. Given o < rt € J, we have 
(15.8) uw <ug <u, on. 


Proof. The operator G(u) satisfies the hypotheses of Proposition 10.8; since u? = 


Ug = u, on OO, (15.8) follows. 
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In particular, taking o = T, we have uniqueness of the solution uz € C®(Q) 
to (15.7). 

Next we record some estimates that are simple consequences of convexity 
alone: 


Lemma 15.2. Assume Q is convex. For any o € J, 


(15.9) Ug <sup y onQD 
0Q 
and 
(15.10) sup |Vu,(x)| < sup |Vuo(y)]- 
rEQ yEOQ 


Thus we will have a bound on u, in C'(Q) if we bound Vu, on OQ. Since 
Us| an = FE C™(0Q), it remains to bound the normal derivative 0,u, on 00. 
Assume 0, points out of 2. Then (15.8) implies 


(15.11) Ovte(y) < wy), Vy € an. 
On the other hand, a lower bound on 0,u,(y) follows from convexity alone. In 


fact, if v(y) is the outward normal to 02 at y, say y = y — £(y)v(y) is the other 
point in OC through which the normal line passes. Then convexity of u, implies 


(15.12) Us (sy + (1 — 8)¥) < sie(y) + (1-8) (9), 
for 0 < s < 1. Noting that ¢(y) = |y — y|, we have 


e(y) — ely) 


OpUs(y) = = 
ly — y| 


Thus we have the next result: 
Lemma 15.3. If is convex, then, for any a € J, 
(15.13) sup |Vug| < Lip'(y) + sup |Vu"|. 
a e) 
Here, Lip'(y) denotes the Lipschitz constant of y: 


(15.14) Lip'(y) = sup 


We now look for C?-bounds on solutions to (15.7). For notational simplicity, 
we write (15.7) as 
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(15.15) log det H(u) — f(z,u,Vu)=0, ula =¢, 
where the second term on the left is 
fo(z,u, Vu) = f(z,u, Vu) + (1—o)A(2), 


and we drop the o. By (15.4) and (15.6), we have f(z,u,p) > O and 
(Ouf) (2, u,p) < 0. 


Since wu is convex, it suffices to estimate pure second derivatives Fu from 
above. Following [CNS], who followed [LiP2], we make use of the function 


2 
w = flv /2 Ou, 


where ( is a constant that will be chosen later. Suppose this is maximized, among 
all unit y € R”, x € Q, at y = yo, x = 2. Rotating coordinates, we can 
assume (9; (%0)) = (0;0,u(xo)) is in diagonal form and yo = (1,0,...,0). Set 
u11 = O2u, so we take 


(15.16) w= AV? /2y 1) = o(Vu)urt. 


We now derive some identities and inequalities valid on all of 2. 
Differentiating (15.15), we obtain 


g? 0,0; 0cu = Oc f(x, u, Vu), 


(15.17) 7 a 
g” O,0ju11 = gg? (0;0;01U) (OxOmO1u) + Ort, 


where (g’”) is the inverse matrix to (g;;) = (0;0;u), as above. Also, a calculation 
gives 


(15.18) 
w | d;w = (log W)p, O;Onu + uj, (O,07u), 


wt O,0;w = w ?(0;w)(O;w) + (log W) was (O;0,U)(O; cu) 
+ (log W) pe (0;0;0;,.u) + as O,0j;U11 a Uj, (OjO7u)(O;07u). 


Forming w~1 g’0;0;w and using (15.17) to rewrite the term uj," 0;0;u11, we 
obtain 


wg a,0;w 
(15.19) >un [(log VL) prpeG (O:0n4) (0; Oeu) + (log) p, 9 O:0;Oxu 
+g g*(0,0;01u) (0,000, u) — uz g) (0,0?u)(0;02u) + 0? f. 


Now we have (log w)», = Spx and (log v) ppp, = 36", and hence 
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(15.20) (log )pxpe 9? (OiOnu)(O;Ocu) = BO" 54 ;,(0j;0eu) = BAu. 
Let us assume the following bounds hold on f (2, u, p): 

(15.21) (VA)(z,urP)l<u, [(’f)(@,u,p)| < we 


Using the first identity in (15.17), we have 


uri (log v)p,9 0,0; OnuU + orf 


15.22 
( > fp, (w d;w)urr C1 | |A?u|? + B(14 |0?ul)], 


with C = O(p, ||Vull za). 

Now, let us look at 2% 9, where, recall, eflVul?/ 202u is maximal, among all 
values of EMV 2A (a), If zo € OD Ge., xo ¢ OQ), then O;w(ao) = 0 and 
the left side of (15.19) is < 0 at xo. Furthermore, due to the diagonal nature of 
(g'7) at zo, we easily verify that g!'gGiiGj1 < g' gh CinGje, and hence 


(15.23) uy g') (O;02u)(OjO2u) < g'*g?"(A;0;01u)(O,0c01U), 
at x9. Thus the evaluation of (15.19) at xo implies the estimate 


(15.24) 0 > B(Afu)(Au) — p— C[1 + |0?u|? + B(1 + |0?ul)] 


if zo ¢ OO. Hence, with X = 67 u(z0), 
(15.25) (8 — Cy) X? < BC2(1+ X) +h, 


where C; and C2 depend on y and ||Vul|_, but not on @. Taking 6 large, we 
obtain a bound on X: 


(15.26) Fu(xo) < C(u, ||Vullr-cay) if xo ¢ AQ. 
On the other hand, if sup w is achieved on OQ, we have 


sup |O5u(x)| < sup |@?u| - exp(||Vullz~). 
x,y 


This establishes the following. 


Lemma 15.4. [fu € C?(Q) 9 C?(Q) solves (15.15) and the hypotheses above 
hold, then 


(15.27) sup |0°ul < C(p, || Vull zc) [1 +sup |07ul|. 
Q dQ. 
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To estimate 07u at a boundary point y € OQ, suppose coordinates are rotated 
so that v(y) is parallel to the x,,-axis. Pick vector fields Y;, tangent to 0Q, so that 
Y;(y) = 0;, 1 <j < n—1. Then we easily get 


(15.28) |A;Pcuy)| <¥GVe~(y)]+ ClVuty)|, L<j,k<n—1. 


In fact, for later reference, we note the following. Suppose Y; is the vector field 
tangent to OQ, equal to 0; at y, and obtained by parallel transport along geodesics 
emanating from y. If Y;, = bf Og, then 


YjYeu(y) = Oj0xu(y) + (B;b.(y)) Aeu(y) 


(15.29) 
= O;Onuly) + (VB, Yr) uly), 


where V° is the standard flat connection on R”. If V is the Levi—Civita connection 
on 0Q, we have Va,Yx = 0 at y, hence V3, Y, = —II(0;,0,) O, at y, where 


O, = —N is the outward-pointing normal and TT is the second fundamental form 
of OQ; see § 4 of Appendix C. Hence 
(15.30)  O;Ogu(y) = YjYeu(y) + 11(0;, On) uly), 1<j,k<n-1. 


Later it will be important to note that strong convexity of OQ implies positive 
definiteness of IT. 

We next need to estimate 0, Y,u(y), 1<k<n-1IfY, = bh (a) Op, then 
Up = Yxu Satisfies the equation 


(15.31) g" 0;0;0n — fp, Ove = A(x) + og" Bi; (2), 
where 


A(x) = 20; di, ar fap di, + fue + fp, (O:b§) Ogu, 


(15.32) P 
Bij (x) = (0:05;0;,) Ocu, 


and vz, | aq = Yr. This follows by multiplying the first identity in (15.17) by bf 
and summing over @; one also makes use of the identity g’? 0;0¢u = 5°. 

We first derive a boundary gradient estimate for v; = Y;u when (15.15) takes 
the simpler form 


(15.33) log det H(u) — f(a,u) =0, lng =o 


that is, Vu is not an argument of f. Here, we follow [Au]. We assume » € 
C™(Q), set 


(15.34) Wr = Y,(u— vy) = up — Yury, 
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then let a and @ be real numbers, to be fixed below, and set 
(15.35) Wp = Wet ah+t B(u— yy). 


Here, h € C®(Q) is picked to vanish on OO and satisfy a strong convexity con- 
dition: 


(15.36) (0;0;h) >I, 0. 


hl 59 = 


The hypothesis that is strongly convex is equivalent to the existence of such a 
function. 
Now, a calculation using (15.31) (and noting that in this case f,, = 0) gives 


(15.37) g'! 0,0;, = A(x) + nB +g Biz(x), Dilog = 0, 
where A(x) is as in (15.32) (with the last term equal to zero), and 
(15.38) Bi; (x) = Bi;(x) — 0:0;Yep + a O,Ojh — 8 O,0;9. 


We now choose a and (3. Pick 3 = Go, so large that A(x) + n6o > 0. This 


done, pick a = ag, so large that (B;;) > 0. Then wzo, defined by (15.34) with 
Q=apo, 2 = (po, Satisfies 


(15.39) 9 0;0;00 > 0,  Broloq = 0- 


Similarly, pick 6 = (3 sufficiently negative that A(z) + n{, < 0, and then pick 


a = aq, sufficiently negative that (B;;) <0. Then, w&,1, defined by (15.35) with 
a = a, and 3 = f, satisfies 


(15.40) g'0;0;0r1 <0, Dralao =0. 
The maximum principle implies wo < 0 and wy; > 0; hence 
(15.41) Yep — ah — Bi(u— yp) < Ypu < Yyy — anh — Bo(u— vy). 
Thus, if 0, denotes the normal derivative at 00, 
(15.42) = |O,Ypul < (ao — a1) |OLA| + (Go — G1) |OLu — OLy| + |OYe yl, 
when w solves (15.33). 

In view of the example (15.5), for a surface with Gauss curvature I, we have 
ample motivation to estimate the normal derivative of Y;,u when u solves the more 


general equation (15.15). We now tackle this, following [CNS]. 
Generally, if w, = Y,(u — y), (15.31) yields 
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g! O,0;Wk — fp, OcWe 


(15.43) Ag 
= [A(z) + fp AY] + g” [Bis (x) — 0,0; Ye] = P(x). 


Note that, given a bound for wu in C1(Q), we have 
(15.44) |®(x)| < C+ Cg”, 


where g’/ is the trace of (g’’). 

Translate coordinates so that y = 0. Recall that we assume v(y) is parallel 
to the x,,-axis. Assume xz, > 0 on Q. As above, assume h € C%®(Q)) satisfies 
(15.36). Take pp € (0,1/4) and M € (0, 00), and set h,,(x) = h(x) — plz|?. We 
have 

@ 0:0; — fp; Oi) (Ry + Mz) 
(15.45) = 9 O,0;hyu — fp; Oihy + 2Mg"” — 2M fy, 2n 


> (50 +2Mg"") —(M fp, n+ fp; Oihy). 


The arithmetic-geometric mean inequality implies 


(29) + Mon), 


j<n 


(Mop-te,) 


and if the eigenvalues of (gi? ) are On < +++ < 01, we have g”” > op, and hence 


(15.46) [M1 det(gi4)]/” < : (g/7 + Mg”). 


n 
Given a positive lower bound on det(g’?) = 1/F(x,u, Vu), we have 
(15.47) ; g?) +2Mg”" > cg +e.MV", 
Hence (15.45) implies 
(15.48) (97 0:0; — fy, 0:)(Ry + Ma?) > cg?) + ey M/" — cp — 3M an. 
At this point, fix M sufficiently large that c; M 1/” > 1 4+ cg, so that 
(15.49) (g" 0:0; — fp, 0:)(Ry + Mz?) >1+cg! —ceg3Mz, on 2. 


Now, let 
O, ={xEN:0< an < eh}, 


as illustrated in Fig. 15.1. We can then pick ¢ sufficiently small that (e.g., with 
= 1/8) 
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. . I 
(15.50) (97 0:0; — fp, 0:)(hp + Maz) > cg?) + zon Oe. 


Note that the function h has the property Vik 4 0 on OQ. Thus, after possibly 
further shrinking ¢, we have 


hy t Mz? < 0 on 00.N ON, 


15.51 
( ) —ce4 <0 on ON {ay = €}. 


With « > O so fixed, we can then pick A sufficiently large (depending on 
Ilell on @y) that c,A > Yt] to (@)3 hence 


we+ A(R + Mz?) 


15.52 
; wr — A(hy + Mz?) 


on 0O,. We can also pick A so large that (by (15.50) and (15.43)-(15.44)) 


(9'0;0; = Fac0h) (wr + A(hy =e Mz?)) 


(15.53) . 
(9° 8;9; — fp,8;) (we — A(hy + Mz°)) 


= 0, 
<0 


on O-. The maximum principle then implies that (15.52) holds on O-. Thus 
(15.54) |AnYeuly)| < AlOnhy(y)]- 
This completes our estimation of 0, Y,u(y), begun at (15.31). 

We prepare to tackle the estimation of 0?u(y). A key ingredient will be a 
positive lower bound on OFuly), for 1 < 7 <n—1. In order to get this, we make 
a further (temporary) hypothesis, namely that there is a strictly convex function 
ut € C®(Q) satisfying 
(15.55) log det H(u*) — f(x,u*,Vu%) <0onQ, u*|,, =. 


The function u* is called an upper solution to (15.1). The proof of (15.8) yields 


FIGURE 15.1 Setup for Normal Derivative Estimate 
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(15.56) ue < tg <p <u*® on O, 


fora <7 € J. In the present context, where we have dropped the o and where 
Ue cr(Q) is a solution to (15.15), this means u? < u < u* on 2. Conse- 
quently, complementing (15.11), we have 


(15.57) 0,u>O0,u%* on ON. 


Now let Y; be the vector field tangent to 0Q, equal to 0; at y, used in (15.30). 
We have 


(15.58) OPu(y) = ¥Puly) + Kjuly), Kj = IT(A;,0;) > 0 


for 1 < 7 <n-—1, by (15.30), assuming 02. is strongly convex. There is a similar 
identity for OFut (y). Since u = u* = —y on OQ, subtraction yields 


(15.59) OFuly) = OFu* (y) + ry [Ovu(y) — Qu*(y)] > Afu*(y), 


J 


for 1 < 7 <n—1, the inequality following from (15.57). Since u* is assumed to 
be a given strongly convex function, this yields a positive lower bound: 


(15.60) uly) >Ko>0, 1<j<n-1. 


Now we can get an upper bound on 6?2u(y). Rotating the 71 ...@_1 coordi- 


nate axes, we can assume (O;Oxu(y)) peas is diagonal. Then, at y, 


(15.61) det H(u oT OFu) + x(0?u), 


where x is an n-linear form in 07u(y) that does not contain 0?u(y). Since det 
H(u) = f(x,u, Vu) and we have estimates on Vu, as well as 0;0,u(y) for 
O;On A 02, we deduce that 


(15.62) Ko 02 u(y) < Ki. 


This completes the estimation of ||u||¢2(@)- 

Once we have a bound in C?(Q) for solutions to (15.15), we can apply The- 
orem 14.6 to deduce the existence of a solution u € C™(Q)) to (15.1). We thus 
have the following: 


Proposition 15.5. Let Q C R” be a smoothly bounded, open set with strongly 
convex boundary. Consider the Dirichlet problem (15.1), with p € C(0Q). 
Assume F(x, u, p) is a smooth function of its arguments satisfying 


294 14. Nonlinear Elliptic Equations 


Furthermore, assume (15.1) has a lower solution u®, and an upper solution ut € 
C™(Q). Then (15.1) has a unique convex solution u € C°(Q). 


After a little more work, we will show that we need not assume the existence 
of an upper solution u*. Note that u* was not needed for the estimates of 


89 = sup Jul, $1 = sup |Vu 
in Lemmas 15.1—15.3. Thus, if we take a constant a satisfying 
0<a< inf {F(z,u,p): 2 €Q, |u| < 80, |p| < si}, 
then any smooth, strongly convex u* satisfying 
(15.63) det Hu") <a on 0, at |2, = y, 


will serve as an upper solution to (15.1). Thus, for arbitrary a > 0, we want to 
produce u* € C%®°(Q), which is strongly convex and satisfies (15.63). For this 
purpose, it is more than sufficient to have the following result, which is of interest 
in its own right. 


Proposition 15.6. Let 2 C R" be a smoothly bounded, open set with strongly 
convex boundary. Let p € C™ (OQ) be given and assume F' € C™°(Q) is positive. 
Then there is a unique convex solution u € C™°(Q) to 


(15.64) det H(u) = F(x), ula = 9. 


Proof. First, note that (15.64) always has a lower solution. In fact, if you extend 
y to an element of C%°(Q) and let h € C%°(Q) be as in (15.36), then u? = y+ Th 
will work, for sufficiently large r. 

Following the proof of Proposition 15.5, we see that to establish Proposition 
15.6, it suffices to obtain an a priori estimate in C?(Q) for a solution to (15.64). 
All the arguments used above to establish Proposition 15.5 apply in this case, up 
to the use of u#, in (15.55)-(15.59), to establish the estimate (15.60), namely, 


(15.65) uly) >Ko>0, 1<j<n-l. 


Recall that y is an arbitrarily selected point in OQ, and we have rotated coordinates 
so that the normal v(y) to OQ is parallel to the x,,-axis. If we establish (15.65) in 
this case, without using the hypothesis that an upper solution exists, then the rest 
of the previous argument giving an estimate in C?(Q) will work, and Proposition 
15.6 will be proved. 

We establish (15.65), following [CNS], via a certain barrier function. It suffices 
to treat the case 7 = 1. We can also assume that y is the origin in R” and that, 
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near y, OO is given by 
n—-1 

(15.66) o = pa) => Bap ole?) Bou, 
j=l 


where a’ = (@1,...,2n—1). 

Note that adding a linear term to u leaves the left side of (15.64) unchanged 
and also has no effect on OFu. Thus, without loss of generality, we can assume 
that 


(15.67) u(0) =0, O;u(0)=0, 1<j<n-1. 


We have, on 02, 


1 
(15.68) u=—=5 PD Vine jer + 33(x") + O(\x|*), 
jk<n 


where %3(2’) is a polynomial, homogeneous of degree 3 in x’. 
Now consider 


(15.69) a(x) = ul(z)—Atp, A= By ly. 


This function satisfies det H(u) = F(a). Looking at ii 5 = y — Xp(2’), we see 
that the coefficients of x? cancel out here. We claim there is an estimate of the 
form 


(15.70) tla < >. arjaiey +C( S- xp + |el*). 


1l<j<n l<k<n 


Indeed, in light of our remark about the disappearance of x7, we need only worry 
about a multiple of x}, which can be dominated on OQ by a term of the form 
G1nX12£», plus a multiple of the quantity in parentheses in (15.70). 
The barrier function will take the form 
1 2 2 
(15.71) W(t) = 55 S> (aijai + Ba;)? + |x)? — ean. 


1<j<n 


Take B >> C, then fix 6 > 0 small, and take ¢ << 6. We can do this in such a 
fashion as to arrange 


(15.72) W>@ on AQ. 


Note that 26 is the smallest eigenvalue of H(W), and all the other eigenvalues are 
bounded above independently of 6 € (0,1), so choosing 6 small enough gives 
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(15.73) det H(W) < F(x) on Q. 


Then W is an upper barrier for w; the maximum principle yields 


(15.74) u<W on. 
Consequently, 
(15.75) On ti(0) < OnW (0) = —e. 


As noted above, our construction (15.69) yields 

(15.76) ule ple )) =0, a2? =; 

that is, 07% + (0,%)0?:p = 0, at x’ = 0. Hence 

(15.77) O?u(0) = 07%(0) = —On%i(0) - 02 0(0) > €0?p(0). 

This proves the 7 = 1 case of (15.65), as needed, so Proposition 15.6 is proved. 


In light of the comments made after the statement of Proposition 15.5, we have 


Corollary 15.7. In Proposition 15.5, the hypothesis that there exists an upper 
solution u* can be omitted. 


There are some results for Monge-Ampere equations on nonconvex domains; 
see [GS] and [HRS]. 

In addition to the Monge—Ampere equations studied here, there are complex 
Monge—Ampere equations, whose study has been very important in complex 
function theory and differential geometry; see [Au, BT, CKNS, Fef, Yau1]. The 
paper [Yaul] established the existence of an important class of compact Kahler 
manifolds known as Calabi-Yau manifolds, which have vanishing Ricci tensor. 
These play fundamental roles in areas ranging from algebraic geometry to string 
theory. See [Bes, Yau4, B11, B12], and references therein, for further exposition. 


Exercises 


1. Let Q C R? be a strongly convex, smoothly bounded region. Let us assume that F’ € 
C™(Q), p € C* (AQ), and F > 0. Show that 


det H(u) = F(z) on 2, ul, =¥, 


has exactly two solutions in C°°(Q), one convex and one concave. 

2. Suppose the hypothesis 0, F(x, u,p) > 0 in Proposition 15.5 is dropped. Establish the 
existence of solutions, using the Leray—Schauder theory. 

3. Given 2 as in Proposition 15.5, ¢ € C'°°(0Q), show that there exists Ko > 0 such 
that, for all K € (0, Ko), there is a unique convex solution ux € C™(Q) to 
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(n+2) /2 


(15.78) det H(ux) = K(1+|Vux|*) on 2, UKs, =. 


(Hint: Show that the convex solution to (15.64), with f = 1, yields a lower solution for 
(15.78), provided K > 0 is sufficiently small.) 
Note that the graph of wx is a surface with Gauss curvature kK. 

4. With wic as in Exercise 3, show that there is uo € Lip'(Q) such that 


(15.79) uk Zuo as K\,0. 
In what sense can you say that wo solves 
(15.80) det H(uo) =0 on Q, woly, = 9? 


See [RT] and [TU] for more on (15.80). 


16. Elliptic equations in two variables 


We have seen in § 12 that results on quasi-linear, uniformly elliptic equations for 
real-valued functions on a domain (2 are obtained more easily when dim 2. = 2 
than when dim 2 > 3 and have extensions to systems that do not work in higher 
dimensions. Here we will obtain results on completely nonlinear equations for 
functions of two variables which are more general than those established in § 14 
for functions of n variables. The key is the following result of Morrey on linear 
equations with bounded measurable coefficients, whose conclusion is stronger 
than that of Theorem 13.7: 


Theorem 16.1. Assume u € C?(Q) and Lu = f onQ C R?, where 
2 . 
(16.1) Lu= S> a*(a) 0jOqu. 
j,k=1 


Assume ai® = a3 are measurable on Q and 
(16.2) AE)? < a?* (w)EjEx < AIEl?, 


for some X, A € (0,00). Pick p > 2. Then, for O CC Q, there is a pp > 0 such 
that 


(16.3) Ilullcr+n(o) < C[lullzcay + If llze@y], 


where C = C(O,Q,p, A, A). 


Proof. Let V; = 0;u. Then these functions satisfy the divergence-form equations 
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es a, (2 av; +255 502V;) + 02(02V1) = ai(-5). 


1 (A1V2) 4 oo Mow ! 24 ar) =a,(4). 


Qa 


Proposition 9.8 applies to each of these equations, yielding 
(16.5) Villon) < C[llVillz2@y + I fllze(a)]. 


This yields the desired estimate (16.3). 


Morrey’s original proof of Theorem 16.1 came earlier than the DeGiorgi- 
Nash-Moser estimate used in the proof above. Instead, he used estimates on 
quasi-conformal mappings (see [Mor2]). 

We apply Theorem 16.1 to estimates for real-valued solutions to equations of 
the form 


(16.6) F(a,u, Vu,0?u) = f onQ CR’, 


where F = F(x,u,p,¢) is a smooth function of its arguments satisfying the 
ellipticity condition 


OF 
NEP sD) ge (@ wep. OEE S AED, 
a] 
0<A=A(u,p,¢), A = A(u,p, ¢). 


(16.7) 


For h > 0, = 1,2, set 
(16.8) Ven (a) = h7" (u(x + hee) — u(z)). 
Then Vz), satisfies the equation 


(16.9) dale )O;O4.Ven = gen(x) 


on Q, = {x € 1: dist(z,R? \Q) > h}, where the coefficients alk (a) are 
given by 


jk | OF 


(16.10) ap, (Z) = (« + shep,...,80°7,u + (1 — s)0°u) ds 
0 OCjk 


with T¢,u(a) = u(x + heg), and the functions ge;, (x) are given by 
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(16.11) 


1 
gen(x) = — > ff i (a + sheg,...,s07T¢,u + (1 — s)0°u) ds| 0; Vex 
j 0 j 


| OF Z : 
-f Dy (t+ shee +++» 80 Ten + (1 —8)O u) ds Von 
0 


' OF 
_ | =—(x + sheg,...,80°t,u+ (1 — s)07u) ds 
0 Ox, 
+ mee: + hee) — Fe): 
Theorem 16.1 then yields an estimate 
(16.12) I|Venllcrtu(oy < Cl Venllz2@a) + Ilgenllzeay], 


with C = C(O,Q,p, 2, A, ||ul] o2(q))- Note that 


(16.13) IIgen|lze@ay S C(llulloz@y) + |2-" (ren f — Alley: 
Letting h — 0, we have the following: 


Theorem 16.2. Assume that Q C R?, that u € C?(Q) solves (16.6), that the 
ellipticity condition (16.7) holds, and that f € H*?(Q), for some p > 2. Then, 
given O CC Q, there is a 1 > 0 such that u € C?*#(O) and 


(16.14) llullc2tuo) S$ C1 + |Ifllnx@], 
where 
(16.15) C = C(O,9,p,d,A, llullor@): 


For estimates up to the boundary, we use the following complement to 
Theorem 16.1: 


Proposition 16.3. Jf u € C?(Q) and the hypotheses of Theorem 16.1 hold, then 
there is an estimate 


(16.16) lll crsn@y S C[llullz2.»(ay + Ilellc2(aa) + IIfllzea], 


where and C = C(Q, p, X, A). 


= uloo 


Proof. Given y € OQ, locally flatten OO near y, using a coordinate change, trans- 
forming it to the x-axis. In the new coordinates, wu satisfies an elliptic equation 
of the form 


(16.17) @*O;Opu = f —Oju= f. 
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Then Vi = O,u satisfies an analogue of the first equation in (16.4), while Vi = 
O1¢p on the flattened part of OQ. Thus Proposition 9.9 (or rather the local version 
mentioned at the end of § 9) yields an estimate on V, in C4(U 1 Q), for some 
neighborhood U of y in R?. 

Thus, for any smooth vector field X on R?, tangent to 02, we have an estimate 
on ||Xul] ou @) by the right side of (16.16). Furthermore, by Proposition 9.9, there 
is a Morrey space estimate 


(16.18) IV Xull g(a) < RHS, 


for some g > 2, where “RHS” stands for the right side of (16.16). We may as well 
assume q < p, so f € L?(Q) C M$(Q). Then (16.17) and (16.18) together imply 


(16.19) |2;Avull area) < RHS, 


for all 7, & < 2, which in turn implies (16.16). 
We now establish the following: 
Theorem 16.4. Assume that Q C R? and that u € C?(Q) solves (16.6), with the 


ellipticity condition (16.7), with f € H'?(Q) for some p > 2, and til ag =. 
Then, for some t > 0, there is an estimate 


(16.20) Ilullc2+u@y < C1 + Ilellos(aq) + [If llan.2(@], 
where 
(16.21) C = C(,p,A,A, |lullcag)- 


Proof. If X = b‘d; is a smooth vector field in R?, tangent to 0Q, then Xu 
satisfies 


Fe, 0;0%(Xu) = — Fy, 0;(Xu) — Fy Xut Fe,,(OjOnb*) (Ocu) 


(16.22) ; ‘ 


and Xu = Xv on QQ. Thus Proposition 16.3 applies. We have a C!t#(Q)- 
estimate on X u, and even better, a Morrey space estimate: 


(16.23) |0;O.X ull me(ay < RHS, 
for some q > 2, and for all j,k < 2, where “RHS” now stands for the right side 


of (16.20). 
The proof is almost done. Parallel to (16.22), we have, for any @, 


(16.24) Fo, Oj O,0eu — —F,, O;Ocu — Fy, Ogu + Of. 
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Thus we can solve for 0;0,0¢u in terms of functions of the form 0;0,Xu and 


other terms estimable in the M4(Q)-norm by the right side of (16.20). Hence we 
have (16.20), and even the stronger estimate 


(16.25) ||°ull g(a) < RHS. 
From this result the continuity method readily gives the following: 
Theorem 16.5. Let Q be a smoothly bounded domain in R?. Let the function 


F(a, u, p,¢) depend smoothly on all its arguments, for 0 € [0,1], and let yp, € 
C™(Q) have smooth dependence on a. Assume that, for each a € [0, 1], 


OuF s(x, U, D, ¢) < 0 


and that the ellipticity condition (16.7) holds. Also assume that, for any solution 
Ug € C™(Q) to the equation 


(16.26) F(x, Ue, Vio, 07g) =O0onQ, lies as = Yo, 
there is a C?(Q)-bound: 
(16.27) lUclloz@y SK. 


If (16.26) has a solution in C®(Q) for o = 0, then it has a solution in C°(Q) 
foro=1. 


Exercises 


1. In the proof of Theorem 16.1, can you replace the use of Proposition 9.8 by a result 
analogous to Proposition 12.5? 

2. Suppose that, in (16.7), A and A are independent of ¢. Obtain a variant of Theorem 16.5 
in which (16.27) is weakened to a bound in C1(Q). 


17. Overdetermined elliptic systems 

Here we look at nonlinear systems 

(17.1) F(z,D™u) =g on 2Q 

that are overdetermined, in that u : Q > R*, g:Q> IR‘, and £ > k. We assume 


F is smooth in its arguments. If u € C™(Q), the linearization of (17.1) at u is 
given by 
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d m 
(17.2) Lu= Gq P (u+tv))|,_o 


a linear k x £ system. We say (17.1) is overdetermined elliptic provided the symbol 
o1(2, €) is injective for each x € 2, € 4 0. Our first goal is to prove the following 
analogue of Theorem 4.6. 


Theorem 17.1. [fr > 0, u € C™*"(Q) satisfies (17.1), and if this system is 
overdetermined elliptic, then, given0 <r < 5, 


(17.3) geCi’—uec™s, 
Proof. Parallel to Proposition 4.7, we have 
(17.4) F(a, D™u) = M(u;2, D)u+ R, 


with R € C®@™ and, as in (4.48)-(4.50), we pick 6 € (0,1) and apply symbol 
smoothing to write 


(17.5) M(u; x, D) = M*(a, D) + M*(z, D), 

where, ifue C™t", r > 0, 

(17.6) M¥(z,€) € Si%, M(z,€) € ST”. 

As in (4.52)-(4.54), the hypothesis that (17.1) is overdetermined elliptic implies 
(17.7) M* (a, D) is overdetermined elliptic. 

Now (17.1) gives 

(17.8) M* (a, D)u=g—R— M(x, D)u=h, 

and Theorem 9.1 of Chapter 13 gives (for r > 0) 


we Cont —» M*(z, D)u € Ott? 


(17.9) 
=+fAeE Corr. 


where a A b = min(a, b). 
Now we exploit (17.7). We have 


(17.10) M* (x, D)*M*(x, D) € OPS?7%, elliptic, 


hence this operator has a parametrix G € OPS) a We set 
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(17.11) E = GM* (x, D)* € OPS’, 
giving 

(17.12) EM#(xz,D)=I+Ro, Ro € OPS-@~. 
Now 

(17.13) E:C? —cCct™, VoeR, 


so we apply F to (17.8), obtaining (mod C'°°) 
(17.14) u=EheECet?, o=sA(r+ro). 
Having this improvement over the hypothesis u € C™*", we iterate this argu- 


ment, obtaining the desired result (17.3). 


Example 1. We produce an overdetermined system satisfied by an isometry 
between two Riemannian manifolds. Let 2,O C R” be open sets, and suppose 
they carry metric tensors G = (g;.), H = (hjx), respectively. Let 


(17.15) u:Q—+O 
be a C! diffeomorphism, and assume u*H = G, ie., 
(17.16) (Du(x)a, Du(x)8) an = (a, B)e 


for all a, 8 € R”. Here (a, 3), = a- G(x), etc. The equation (17.15) has an 
equivalent form 


(17.17) Du(x)'H(u(x)) Du(x) = G(z). 


This is a first-order system for u. Taking symmetry into account, we have n(n + 
1) /2 equations in n unknowns. The linearization of the left side about u is given 
by 


(17.18) Lv = Du(x)'H(u(x)) Do(x) + Dv(x)'H(u(x)) Du(z) 
+ Du(x)'DH(u(x))v(x) Du(x), 
with principal symbol 


(17.19) ot(a,€): R” — M(n,R) 


satisfying 
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oL(a, = 2x symmetric part of 
i7oe) L(x, €)p ‘ P 
Du(a)'H(u(2)) op(, €)o- 
Since op(x, é)W = ie", the right side of (17.20) is 7 times 
(17.21) Du(x)'H(u(x)) we". 
The ellipticity condition is that a, (x, €)w is nonzero unless € = 0 or w = 0, or 
equivalently that, for each x € 2, the symmetric part of (17.21) is nonzero unless 


€=Oory=0. 
To check this, note that, for a € R”, 


(17.22) a+ Du(x)'H(u(2)) va = (a-)(a-y), y= Du(2)’A(u(z))d, 
which vanishes for all a € R” if and only if either € = 0 or y = 0,7 and since H is 
positive definite and Du(x)* is invertible, the latter happens if and only if ~ = 0. 
We hence have: 

Proposition 17.2. [fu : Q > O is aC! diffeomorphism satisfying (17.16), with 
H € C',G € C® metric tensors, then the system (17.17) is overdetermined 
elliptic. Consequently, ifr > 0, s > r, and 

(17.23) ued", Hec™, GeEec?, 

then Theorem 17.1 applies, to give 


(17.24) ue Clr, 


A weak point of Proposition 17.2 is the hypothesis in (17.23) that H € C°. 
We aim to establish the following improvement. 


Proposition 17.3. In the setting of Proposition 17.2, we can replace (17.23) by 
(17.25) ueC", Hec*®, Gec’, 
and still conclude that wu satisfies (17.24). 
Our approach to the proof uses the following. 
Proposition 17.4. Consider the overdetermined PDE 
(17.26) ©),(z, Du) = Du(z)'h(x) Du(x) = G(z), 


where u: Q + O is aC! map, G,h : Q — M(n,R), symmetric and positive 
definite. Then, givenO <r < sand 
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(17.27) eel, hed, -Gec%, 
we have 
(17.28) we ci, 


Proof. It follows from Proposition 10.12 of Chapter 13 and subsequent discus- 
sion that 


(17.29) ®, (a, Du) = M(u;z,D)ut+ R, REC, 


and, with 6 = 1-—r/s, y € (6,1), 


(17.30) M(u;2,D) = M#(a, D) + M°(a, D), 
with 
(17.31) M*#(z,D)€ OPS},, M(x, D) ¢ OPO?S,,~”, 


Hence (17.26) implies 
(17.32) M* (x, D)u = (G — R) — M®(a, D)u. 


Computations leading to Proposition 17.2 give that (17.26) is overdetermined 
elliptic, hence M* (x, D) is overdetermined elliptic. Therefore we have 


(17.33) E€ OPS;, 
such that 
(17.34) EM#(xz,D)=I+ Ro, Ro € OPS-@~. 


Applying F to (17.32) gives 
(17.35) u = E(G— R)— EM°*(z, D)u, mod C®. 
Note that E(G — R) € C}*%. Hence, by (17.31) and (10.109) of Chapter 13, 


uecir > M(x, D)u€C™, 1, = min(r + (7 — 4)s, 8) 
(17.36) => EM*(2,D)ue ch 
=>ne Cr, 


an improvement over the initial hypothesis on wu. An iteration gives the desired 
conclusion to Proposition 17.4. 
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Proof of Proposition 17.3. Given the hypotheses of Proposition 17.3, including 
O0<r<-s,set 


(17.37) h(x) = H(u(2)). 
Then 
(17.38) wec’ —heEC’, o=min(s,1+1), 


and Proposition 17.4 gives 
(17.39) vec. 


This is an improvement over the initial hypothesis on u. An iteration gives the 
desired conclusion, u € C}t®. 


Proposition 17.3 is still not quite sharp. As shown in [T4], one can sharpen its 
conclusions to 


(17.40) uecCl, H,Gec®’—=uec'™, 
valid for all s > 0, including integers. The proof follows completely different 
lines, involving existence and regularity of local harmonic coordinate systems. 


Treatment of the case s € N uses a trick from [CaH]. 


Example 2. The following overdetermined system for a real-valued function u 
arises in the study of the Wey] tensor in (C.40): 


1 1 SOjk . 
(17.41) Usk = Uitte — 5 G5nldule + (8 — Rice). 
Here Ric is the Ricci tensor and S' the scalar curvature. It is shown in Proposi- 
tion C.4 that if the Weyl tensor Wisk vanishes and if one can find a solution 


u € H'? (with p > n) to (17.41) (assuming also gj; € H'”), then 9, = e?"gjx 
has vanishing Riemann tensor Ry jk- Existence of solutions to (17.41) is classical 
if gj~ € C®. Then one obtains u € C™ provided g;, € C™ andm € N, m > 3. 
Our goal here is to establish further regularity results about u, given other regu- 
larity hypotheses on g;,. 

One way to derive a determined elliptic PDE from (17.41) is to multiply by 
g* and contract, using the identity 


(17.42) gr u.j.~ = Au, 


where A is the Laplace-Beltrami operator. Then we get 
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n—2 1 
17.4 Au = du|? . 
cise) a g Ga” 


While (17.43) is a neat looking formula, there is a more elementary way to estab- 
lish regularity in this case. Since 


(17.44) Unjik = OpOju — 15, Oeu, 


we have from (17.41) that 


(17.45) 0;0,u =T;,Ocu + right side of (17.41). 
Now, if r > 0, 

(17.46) giz € C1*” => T* 5, € CT, Ricje, SEC", 
SO 


ue C! => 0j;0,u€ C°+ C71" 
(17.47) : ; 
—=ueCce, og =min(1,r). 
If o <r, we can iterate the argument, obtaining 
(17.48) 0;0,u EC? +C,'t", hence ue C24? 4 C7t, 


and ultimately obtain the following: 


Proposition 17.5. Assume gj, € C't", r > 0, and that u € C’ solves the 
overdetermined system (17.41). Then 


(17.49) we ch, 


REMARK. If in Proposition 17.5 one has r € N, then one can change the last part 
of (17.46) to 


(17.50) Rica, Seo, 

and, having from (17.49) that u € C?~1, obtain directly from (17.45) that 
(17.51) uec tt, 

refining (17.49). 

Example 3. The Weyl tensor as an overdetermined system. 


When the Riemann curvature tensor Ri; x on an n-dimensional Eiemannian man- 
ifold is decomposed into pieces invariant under the SO(n) action, one piece is the 
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Wey] tensor, 


(17.52) 
1 
W iin = R ijk + ae [oj Riciz, ai, Ric;; +Gik Ric! ; — Jij Ric’ x| 
S 
| O49; — 0° 5 9ik). 
(na Ay(m aay HI F 4 9H4) 


This tensor has the remarkable property that if we conformally scale the metric 
tensor, obtaining gj, = e7"g;,, its associated Weyl tensor is unchanged: 


(17.53) W isn = Wa 


Details can be found in Appendix C to this chapter. As shown there, for a metric 
tensor with limited regularity, 


(17.54) gjk € H)”, p> n= Win € HO)”, 


and (17.53) holds if also u € H'?. We have the result of Weyl that the vanishing 
of W* isk is necessary for the metric tensor to be locally conformally flat. The 
converse, due to Schouten, in its classical version, states that, if gj, € C®, then 
this vanishing is sufficient for the metric to be conformally flat, provided n > 4. 
(The classical approach to this involves the overdetermined system considered in 
Example 2.) See Proposition C.2. This result has the following improvement, due 
to [LiS2]: 


Proposition 17.6. Assume n > 4 and gj, € C" for some r > 1. If W*i;x = 0, 
then the metric tensor is locally conformally flat. 

By way of introduction to the method of solution, we mention the following 
somewhat parallel but simpler result, established in [T4]. 
Proposition 17.7. Assume a Riemannian manifold M has metric tensor 
(17.55) grEC, r>1, 


and Ro isk = 0. Then there exists a local isometry of a neighborhood of any 
point p onto an open set in flat Euclidean space, and yp € C"**. Furthermore, if 


(17.56) gin €C7N H"?, re (0,1), 
and Ro iik = 0, there is such a local isometry, and ip € Cre. 


Here, the first step taken in [T4] is to take local harmonic coordinates 
(u1,..., Un). These exist as long as r > 0, and we have 
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(17.57) uj € C'T" 1 H??, 


as long as (17.56) holds, as shown in [T3], Chapter 3, Proposition 9.4. Now, as 
mentioned in (4.94) of this chapter, in local harmonic coordinates we have 


1 a z a ee . 
(17.58) 5 35 OG"*(w)Akdem + Qem(G, VI) = Ricem, 


where gem are the components of the metric tensor in local harmonic coordinates, 
a priori in CO" 0. H*, but thanks to the determined elliptic PDE (17.58) actually in 
C@ if Ric = 0. See Proposition 12B.2, which strengthens Proposition 4.10. From 
here, the standard argument, given in Proposition 3.1 of Appendix C, Connections 
and Curvature, yields a local isometry onto an open set in flat Euclidean space, 
C®™ in the harmonic coordinates, hence with the stated regularity in the origi- 
nal coordinates, when (17.56) holds. Further arguments of [T4] yield the stated 
regularity when (17.55) holds. 

The approach to Proposition 17.6 taken in [LiS2] differs from the approach to 
Proposition 17.7 described above in three crucial respects: 
(i) One needs an equation for a conformally rescaled metric tensor g;,. 
(ii) One uses, not local harmonic coordinates, but rather n-harmonic coordinates. 
(iii) Then one shows that the Weyl tensor has the form of an overdetermined 
elliptic system for the rescaled metric tensor g;,. 

Here we sketch the steps to prove Proposition 17.6, referring to [LiS]-[LiS2] 
for details. 

The need for (i) arises from the invariance (17.53). We define the conformally 
rescaled metric tensor 


(17.59) Gjx =|9\-/" 95x, |g] = det (gin). 


To define the class of n-harmonic coordinates, we start with the definition that, 
if 1 < p < oo, a function u on an open set 2. C M is p-harmonic provided 


(17.60) d* (\du|?~*du) = 0 
on 2. For p = 2 we have the usual class of harmonic functions. 


In Theorem 2.1 of [LiS] one has: 

If gin € C™, r > 1, in some coordinate chart about x9 € M, then there exists 
a local coordinate chart about xq whose coordinate functions are p-harmonic and 
have C’*t! regularity. Moreover, all p-harmonic coordinates near 29 have C’*! 
regularity. 


An accompanying result is Proposition 2.5 of [LiS]: 
If 93% € C”, r > 0 in some local coordinate chart ~ about x, and if T’ is a tensor 
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field of class C” in y coordinates, then T is of class CT in every p-harmonic 
coordinate system. 


Proposition 2.6 of [LiS] singles out n-harmonic coordinates: 
Any coordinate chart on M on which g;, is a scalar multiple of 0; is n-harmonic. 


A related result of substantial use in [LiS]—[LiS2] is: 
If gjx,v € C", r > 1, and if wu is n-harmonic for the metric g;,, then u is also 
n-harmonic for the metric 9, = €7" gjr. 


These results from [LiS] are key ingredients in the following regularity result, 
Theorem 1.2 of [LiS2]: 

Let M be a Riemannian manifold of dimension n > 4. Assume gjx € C", r > 1, 
in some local coordinates. Assume 


(17.61) Win € C, for some s > r — 2 in n-harmonic coordinates. 
Then 
(17.62) jx = |g /" gin € C2*? in these coordinates. 


Derivation of Proposition 17.6. In the setting of this proposition, we can take 
local n-harmonic coordinates, in which the metric tensor gj; is CY, and the Weyl 
tensor is still = 0. The regularity result stated above gives g;, € C°, and its 
Weyl tensor is = 0. Thus the classical result of Schouten gives that g;; is locally 
conformally flat. 


The task that remains is to describe how Theorem 1.2 of [LiS2] is established. 
So let gj, € Cf, r > 1, in some coordinate system. Then there exists an n- 
harmonic coordinate system, in which gj, € Cy. Ifn > 4 and (17.61) holds, 
produce §;x = | gh te "jr, which has the same Wey] tensor W* sik therefore still 
satisfying (17.61) (since g;;, has the same family of n-harmonic coordinates). 

Now the key is to analyze W°; jk as 
(17.63) Wijk = Fey (2, Do), 
taking the defining formula (17.52) and plugging in the formulas (C.9)-(C.10) 
(with g replaced by g) to give Ric and S. We then reverse (17.63), 


(17.64) F*je(2, D729) = Wijk, 
and analyze this as a second-order nonlinear system of PDE for (g;,). The key 


technical result of [LiS2] pertaining to the structure of (17.64), established in 
Lemmas 2.3 and 3.2 of that paper, is the following. 
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Lemma 17.8. Let 9;, € Cl, r > 1, in n-harmonic coordinates, and form gjr. 

Then the principal symbol of the linearization of (17.64) at g is injective. Hence 
(17.64) is an overdetermined elliptic system. 

Having stated this, we must note that Theorem 17.1 does not apply in this 


setting; the smoothness hypothesis on g is not great enough. Here we need to 
make use of special structure of (17.64), namely 


(17.65) F(2,D°g) = S > A(x, 9)0 B(a, D§). 


ja|=2 
Such a special class has a well defined linearization for g € C’, r > 1. The 
proof of Lemma 17.8 presented in [LiS2] is elaborate, involving consideration of 
another tensor from conformal geometry, the Bach tensor, germane when g has 
greater regularity, followed by further calculations to relax the regularity hypoth- 
esis. We refer to [LiS2] for details. 
We move on to a general result, parallel to Theorem 17.1, dealing with such 


a class of systems as appears in (17.65). To phrase the result in a more general 
setting, we look at nonlinear systems 


(17.66) do Aa(2,u)O%u + Bla, Du) = g on 0 


ja|=2 


that are overdetermined, in that uw: Q > R*, g: 25 R°, and @ > k. We assume 
Aq and B are smooth in their arguments. Here is an analogue of Theorem 17.1. 


Theorem 17.9. [fr > 1, u € C"(Q) satisfies (17.66), and if this system is 
overdetermined elliptic, then, given s > r — 2, 


(17.67) geC>uec™. 

Proof. Parallel to (17.5), we pick 6 € (0, 1) and write 

(17.68) Aq (a, u) = A# (2, D)u+ A(x, D)ut Ra, 
with Ra € CC®, where, ifu EC", r>1, 

(17.69) A#(x,D)€ OPS?5, A‘(x,D) € OPS;7°. 
Pick 6 so close to 1 that ré > 1. Then 

(17.70) Aig ae, 


since r(1+6) > 2. The hypothesis that (17.66) is overdetermined elliptic implies 
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(17.71) M#(a,D) = S~ A#(2x, D)d® is overdetermined elliptic in OPS} 5. 
ja|=2 


As in (17.12), we have FE € OPS; ; such that 
(17.72) EM#(xz,D)=I+Ro, Ro € OPS-~. 
Meanwhile, (17.66) gives 


(17.73) M*(ax,D)u=g — B(x,Du)— > Ab(x, D)d°u=h. 
lo|=2 
Hence, mod C™, 


u = Eg— EB(«,Du)—E S~ Ab(x,D)a° 


|a|=2 


(17.74) é Cor ris aa A. Gre 
=CP, p=min(s+2,r+1). 


This is an improvement over the hypothesis u € C7, and one can iterate this 
argument until the desired conclusion in (17.67) is obtained. 


Combining techniques used to prove Proposition 4.9 with those used above, 
we have the following variant of Theorem 17.9. 


Proposition 17.10. Consider the overdetermined system 


(17.75) S50; Asn (a, Uw) Opu + Q(x,u, Vu) =g on 2. 
j,k 


Assume Ajk and Q are smooth in their arguments, that 


(17.76) u€H'4, gq>n, 
and that 
(17.77) |Q(x,u, Vu)| < C(u)(Vu)?. 


Also assume (17.75) is overdetermined elliptic at u, i.e., 


(17.78) S- Ajn (a, u)E;Ex is injective, for € # 0. 


ik 


Then, given p € (q,0o), s > —1, 
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(17.79) g¢ H®? ue He, 
Here is another variant. 


Proposition 17.11. Consider the overdetermined system 


(17.80) Sj Aj (a, u, Vu) =gonQ. 
J 


Assume each Aj is smooth in its arguments and 
(17.81) ger, PSH, 
Also assume (17.80) is overdetermined elliptic at u. Then 


(17.82) g€ H*?, s>-1,l<p<o=ue H*??, 
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Proof. Parallel to the proof of Theorem 17.1, we can take 6 € (0,1) and write 


(17.83) S° 0; A;(2,u, Vu) = 5° d;MFut >) 0;Meu+ R, 


J J J 
with R € C™ and, foru € C!t+", r > 0, 
(17.84) M? €OPSis, M?eOPst;”. 


As in (17.7), the hypothesis that (17.80) is overdetermined elliptic implies 


(17.85) S- a; Mf is overdetermined elliptic, 

so we have 

(17.86) Ee€OPS;3, E)_\d;M? =I mod OPS-@, 
j 


and (17.80) yields (mod C™) 


(17.87) u=Eg- EY" 0;MPu. 
j 


Now 


ue Citt => MPue crt 


(17.88) = EX 0;MPu E ore. 
J 
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and 
(17.89) g ¢ H®? = Eg € He*??, 
Hence we have 
(17.90) the TPG rs, 
To proceed, assume in addition to (17.80)-(17.81), which leads to (17.84), that 
(17.91) ue Hera gh?, »>0,e>=1, 
Then 


M? € OPS1;" = Miu € Het rer 4 ptr 


> Oj Miu © Heth 4 Orttetrs 


(17.92) 
=> EY 0;Mju e Hst2trép ais Clee 
j 
NiO) 
(17.93) uUu= Eg an EX 0;Mpu € Het?P + Gite 


J 


Iterating this gives 
(17.94) u € Het 4 Citeth)) 6 WREN, 


hence u € H*+?-P, as asserted. 


We have used paradifferential operator calculus for the local regularity results 
of this section. It is of interest to compare a simpler approach, mentioned in 8J of 
the Appendix to [Bes]. Suppose u € C™*"(Q) solves (17.1), i-e., 


(17.95) F(a, Du) = g. 

The linearization at u is of the form 

(17.96) L(x, D)v = 4g, 

and if (17.95) is overdetermined elliptic at u, then (17.96) is overdetermined ellip- 
tic. To establish local regularity near zg € , [Bes] mentions one can freeze 


coefficients there, obtaining the constant coefficient operator L(D) = L(2o, D), 
which is overdetermined elliptic. Then 
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(17.97) L(D)* F(x, D™u) = G 


(with g = L(D)*g) is determined elliptic near xo, and one can obtain regularity 
results from those for determined elliptic PDE. This approach requires somewhat 
stronger a priori assumptions on the regularity of wu than is natural. In this respect 
the approach taken above has an important advantage. 

On the other hand, given that wu has sufficient a priori regularity (say wu € 
C?™*+<), then one can use (17.97). Here is one good use. Suppose the equation 
(17.95) has F' real analytic in its arguments (and g real analytic), and also overde- 
termined elliptic. Then results on analytic regularity established in [Mor2] apply 
to (17.97), to yield analytic regularity of u. Now, one can have the best of both 
worlds by first applying results like Theorem 17.1 to yield C°° regularity for wu 
under less stringent hypotheses, and then applying the analytic regularity results 
of [Mor2] to (17.97). 

The paper [Bry] has used this to good effect. This work belongs to a gen- 
eral area known as the theory of exterior differential systems, an area introduced 
by E. Cartan and given a modern presentation and substantial development in 
[BCG3]. It encompasses a broad class of overdetermined systems of nonlinear 
PDE, with many connections to differential geometry. A key ingredient in the 
study is an extension of the Cauchy-Kowalewski theorem known as the Cartan- 
Kahler theorem. In this setting, it is essential to work with real analytic equations 
and solutions. Thus it is important to have analytic regularity results, and useful 
to have them for solutions whose a priori regularity is not too strong. 


A. Morrey spaces 


Given f € Lii.(R”), p € [1, 00), one says f € M?(R”) provided that 


(A.1) R- i lf(x)| dx << CR-"/?, 


Br 


for all balls Bp of radius R < 1 in R”. More generally, if 1 < q < p and 
f € Li,(R”), we will say f € M?(R") provided that, for all such Br, 


loc 


(A.2) R / |f(a)|4 dx < CR-"4?, 


Br 


The spaces M?(IR") are called Morrey spaces. If we set dr f(x) = f(Rx), the 
left side of (A.2) is equal to [ p, \Orf(x)|7 daz, so an equivalent condition is 


(A.3) Sef lla.) < CR”, 
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for all balls By of radius 1, and for all R € (0,1). It follows from Hélder’s 
inequality that 


Dipig(R”) = MP(R") Cc M?P(R") Cc M?(R"). 


We can give an equivalent characterization of M/” in terms of the heat kernel. 
Let p,(€) = e7!"§I", Then, given f € L1.,,(R”), 


unif 
(A.4) f € M?(R") = p,(D)|f| << Cr-””, 


for 0 < r < 1. To see the implication >, given x € R” write f = fi + fo, 
where f; is the restriction of f to the unit ball B,(a) centered at x, and fo is the 
restriction of f to the complement. That p,(D)|fi|(x) < Cr-"/”, for r € (0, 1], 
follows easily from the characterization (A.1) and the formula 


pr(D)62(y) = (4mr?)—/? em le-P/4r* 


while this formula also implies that p,(D)| f2|(x) is rapidly decreasing as r \, 0. 
The implication < is similarly easy to verify. Note that 


(A.5) f satisfies (A.4) => |p,(D)f| < Cr7"/?. 


Recall the Zygmund spaces C7 (IR”), r € R, introduced in § 8 of Chap. 13, 
with norms defined as follows. Let Uo(€) € C§°(R”) be equal to 1 for |€| < 1, 


set UW, (£) = Wo(2-*E), and let dy (€) = Wa(E) — Vy (€). The set (a, (E)} isa 
Littlewood—Paley partition of unity. One sets 


(A.6) IIf| 


cep 2°" be (D) fllz~- 


For r € (0,00) \ Z*, C® coincides with the Hélder space C’, and C} is the 
classical Zygmund space. As shown in Chap. 13, one has, for all m, r € R, 


(A.7) P € OPST, => P:CT 3 Cl. 


The following relation exists between Zygmund spaces and Morrey spaces. 
From (A.4)-(A.5) we readily obtain the inclusion 


(A.8) M?(R") c Cy"/?(R"). 
From this we deduce a result known as Morrey’s lemma: 


Lemma A.1. [fp > n, then, for f € S’(R"), 
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(A9) Vf eM?(R")—> feCt(R"), r=1- 7 E (0,1). 
Proof. We can write 
(A.10) f= 52 Bi (Ost) +Rf, B;¢OPS\(R"), R¢ OPS-~(R"). 
j=l 
Then (A.7)-(A.8) imply that B; 0; f € CY(R"), if the hypothesis of (A.9) holds. 
If Q C R” is a bounded region, we say f € M?(Q) if fe M?(R"), where 


f(x) = f(a) for x € Q, 0 for « ¢ Q. If BQ is smooth, it is easy to extend (A.9) 
to the implication (for p > 7): 


(A.11) Vfe MP9) > feo"), Pat (0, 1), 


via a simple reflection argument (across 0). 

One also considers homogeneous versions of Morrey spaces. If p € (1,00) 
andl <q<p, f € Li,(R”), we say f € M?(IR”) provided (A.2) holds for all 
R € (0,00), not just for R < 1. Note that if we set 


1/ 
(A.12) Ifllang =sup R°/(R-” f |f(o)|" ax) 
R Ba 


where R runs over (0,00) and Br over all balls of radius R, then 


(A.13) lor flue =r”? |Iflle, 


where 6,.f(«) = f(ra). This is the same type of scaling as the L?(IR”)-norm. 
It is clear that compactly supported elements of M?(IR") and of Mf (IR”) coin- 
cide. In a number of references, including [P], Mi is denoted £,,,, with A = n 


(1 —q/p). 


The following refinement of Morrey’s lemma is due to S. Campanato. 


Proposition A.2. Given p € [1,00), s € (0,1), assume that u € L?,(IR") and 
that, for each ball Br(x) with R <1, there exists a € C such that 


(A.14) ii |u(y) — al? dy < CR"tPSs, 
Br(a2) 


Then 


(A.15) u € C2.(R"). 
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Proof. Pick y € C§°(R”) to be a radial function, supported on |x| < 1, such that 
@(E) > 0, and let = Ay, so f 7 dx = 0. It suffices to show that 


(A.16) \(br*u)(x)|<CR*, R<1, 


where p(x) = R-"y(R-1 x). Note that, for fixed z, R, a = a(Br(z)), we 
have 


(A.17) (wr *u)(x) = pr * (u—a)(2), 


|r *u)(2)| 


$< ll¥allzo' (ap (yllu — allee(Ba(e)) 


(A.18) 1/p’ 1/p 
< ( Ro"? |b(Ro*y)|? i) ( if |u(y) — al? iy 
B 


Br(0) R(x) 


ZO RRP . RHP Ria Re. 


as desired. 


B. Leray—Schauder fixed-point theorems 


We will demonstrate several fixed-point theorems that are useful for nonlin- 
ear PDE. The first, known as Schauder’s fixed-point theorem, is an infinite 
dimensional extension of Brouwer’s fixed-point theorem, which we recall. 


Proposition B.1. Jf kK is a compact, convex set in a finite-dimensional vector 
space V, and F': K — K is acontinuous map, then F has a fixed point. 


This was proved in § 19 of Chap. 1, specifically when K was the closed unit 
ball in R”. Now, given any compact convex Kk C V, if we translate it, we can 
assume 0 € KK’. Let W denote the smallest vector space in V that contains K’; say 
dimg W = n. Thus there is a basis of W, of the form F C Kk. Clearly, the convex 
hull of & has nonempty interior in W. From here, it is easily established that K 
is homeomorphic to the closed unit ball in R”. 

A quicker reduction to the case of a ball goes like this. Put an inner product 
on V, and say a ball B C V contains kK. Let ~ : B — K map a point z to the 
point in Kk closest to x. Then consider a fixed point of Fow: Bo KCB. 

The following is Schauder’s generalization: 


Theorem B.2. [f K is a compact, convex set in a Banach space V, and 
EF’: K > K is acontinuous map, then F has a fixed point. 
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Proof. Whether or not V has a countable dense set, K certainly does; say {v; : 
j € Z*} is dense in K. For each n > 1, let V,, be the linear span of {v1,..., Un} 
and K,, C K the closed, convex hull of {v,,...,v,}. Thus K,, is a compact, 
convex subset of V,,, a linear space of dimension < n. 

We define continuous maps Q,, : K — K,, as follows. Cover K by balls of 
radius 6, centered at the points vj, 1 < 7 < n. Let {ynj :1< 7 < n}bea 
partition of unity subordinate to this cover, satisfying 0 < y; < 1. Then set 


(B.1) Qn(v) = >) ong (v)vj, Qn: K > Ky. 
j=l 


Since Ynj(v) = 0 unless ||v — v,|| < dn, it follows that 
(B.2) Qn(v) — v|| < bn. 


The denseness of {v; : j € Z*} in K implies we can take 6,, + 0 as n + oo. 
Now consider the maps F,, : K, — Ky, given by F, = Q, 0 Fi. . By 
Proposition B.1, each F;, has a fixed point x, € K,. Now . 


(B.3) QnF (an) = Un => ||F (an) — tn|| < On- 


Since K is compact, (x,,) has a limit point « € K and (B.3) implies F(x) = 2, 
as desired. 


It is easy to extend Theorem B.2 to the case where V is a Fréchet space, using 
a translation-invariant distance function. In fact, a theorem of Tychonov extends 
it to general locally convex V. 

The following slight extension of Theorem B.?2 is technically useful: 


Corollary B.3. Let E be a closed, convex set in a Banach space V, and let F : 
E — E bea continuous map such that F(E) is relatively compact. Then F' has 
a fixed point. 


Proof. The closed, convex hull K of FE) is compact; simply consider F | 7: 
which maps K to itself. 


Corollary B.4. Let B be the open unit ball in a Banach space V. Let F : B + V 


be a continuous map such that F(B) is relatively compact and F (OB) C B. Then 
F has a fixed point. 


Proof. Define a map G : B + B by 


Ge) =Fle) if |F@I <1 G@%= oo 
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Then G : B + B is continuous and G(B) is relatively compact. Corollary B.3 
implies that G' has a fixed point; G(a) = x. The hypothesis F(OB) C B implies 
|z|| < 1, so F(a) = G(x) = 2. 


The following Leray-Schauder theorem is the one we directly apply to such 
results as Theorem |.10. The argument here follows [GT]. 


Theorem B.5. Let V be a Banach space, and let F : [0,1] x V > V bea 
continuous, compact map, such that F(0,v) = vo is independent of v € V. 
Suppose there exists M < oo such that, for all (a,x) € [0,1] x V, 


(B.4) F(o,2) =a% => |2|| < M. 
Then the map F, : V + V given by F\(v) = F(1,v) has a fixed point. 


Proof. Without loss of generality, we can assume vg = 0 and M = 1. Let B be 
the open unit ball in V. Given < € (0, 1], define G. : B + V by 


1 lll] _# 
a 
e |lal| 


F(1, . ) if ||x|| <1—e. 
l-e 


Ga(e) = F ( ) if L—e<|la|| <1, 


Note that G-(OB) = 0. For each « € (0, 1], Corollary B.4 applies to G-. Hence 
each G, has a fixed point x(¢). Let x, = x(1/k), and set 


. 1 
on =k(1— axl) if 1-7 < [lee] <1, 
. 1 
1 if |laz|| <1-—-<, 


so ox € (0,1] and F(ox,r2~) = xp. Passing to a subsequence, we have 
(o%,%) — (o,2x) in [0,1] x B, since the map F is compact. 

We claim o = 1. Indeed, if 7 < 1, then ||a,|| > 1 — 1/k for large k, hence 
||x|| = 1 and F(o, x) = «, contradicting (B.4) (with M = 1). Thus o;, > 1 and 
we have F'(1, x) = a, as desired. 


There are more general results, involving Leray-Schauder “degree theory,” 
which can be found in [Schw, Ni6, Deim]. 


C. The Wey] tensor 


We recall that if J/ is an n-dimensional Riemannian manifold, with metric tensor 
(gjx), in local coordinates, then the connection 1-form I is given by 
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(C.1) My = 50°" (850m + O69jm — Om9o;); 
and the Riemann curvature tensor R = (R%y;x) is given by 
(C.2) R=al+Tar, 
leading to the Ricci tensor and scalar curvature 
(C.3) Rico, = Rojn, Ric’, = g?’Ricyr, S$ = Ric? ;. 


We are interested in these objects when our metric tensor has limited smooth- 
ness, and in that regard recall that 


(C.4) Gk E C? => Re ojk, Ricox, Ric? Ke C°. 
Going further, 


gin € CON HH? = Rain, Rico, € H-'? +L, 


C5 ; 
ce?) Ric’ x, Se HY, Vp <- 
nee 


The stated results on R*o5K and Ricy, hold because we have ['%,; € L?, and the 
stated results on Ric? ;, and S hold because pointwise multiplication of functions 
extends to a continuous bilinear map 


(C.6) HY? x Ho? 4 HP, vp < 
7 


as follows by duality from H!? x H!? > H':?, Vp > n. Furthermore, 
(C.7) —-gjx € Hl, p> n => Rsjx, Ricox, Ric? ,, S € H-)”, 

since 

(C.8) AH? x HYP", 1?" hence HY? x HH” +H” Yp>n. 


We now seek formulas for how these tensors change if we replace the metric 
tensor g;; by a conformal deformation, 


(C.9) Fix = Cn giK. 


Out of this calculation the Wey! tensor will arise, as an invariant tensor. For the 
new metric tensor, we have g/* = e~?“g/", and 


(C.10) Ts; = My; + O° yu; + O° Up = 959° Ue. 
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Here we set 
(C.11) Uy = O;u, Ajr = Uj;k — UjUk, 


where the covariant derivative is taken using *);, the connection form for g;x. 
Then the new Riemann tensor satisfies 


R' vjx = yp + Og Agy — Op Any 
(C.12) + 9" (gv; Ack — Gok Ac;) 


+ (6% 4905 — 9°; 90K) go" uel 


If gjx,u € C”, then g,,, € C” and (C.4) holds for Re yjky Ric jk, Ric’ ks and S. If 
gjksu € CON H*”, then g,, € CN H*? and (C.5) holds for R’ ojks etc. We 
have 


(C.13) U,gik € CON H'? => uj € L*, Aj e HP +L, 


and in such a case (C.12) is well defined and valid, with both sides in H —1LP" for 
each p’ < n/(n — 1). Contracting (C.12) yields 


(C.14) Ricj, = Ricjx +(n — 2) Ajx + gjx(Au + (n — 2)|dul?), 


where A denotes the Laplace-Beltrami operator and |du|? = g!"ugtm € L?. 
Raising indices gives 


(C.15) Ric’, = 672" [9° Rice, +(n — 2)g? Agcy + 574(Au + (n — 2)|dul?)). 
Here we must be careful to define 


(C.16) e- gi" Rico, em 2% gh? Ag, C HO”, Vp < a 
n— 

under the hypotheses of (C.13), by first forming e~?"g!" € C°M H!? and then 

using (C.6). Similarly we have 


(CAT eM Au = (47); (712g Ogu) © HOO, 


for all p’ < n/(n — 1), under such hypotheses, when 7 = det(g;x), and we have 
e %4y—1/2 © °K Hh? and 71/2g3* € Con H'?. Since passing from (C.14) 
to (C.15) seems to force taking products in the opposite order we note that (C.15) 
can be justified for general g;, € C° 1 H' (with the interpretation above) by 


taking smooth ys 
similarly for u‘”) — u, and passing to the limit. 
Next, contracting (C.15) gives 


approximating g,;; in the C°-norm and the H':?-norm, and 
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(C18) S=S,+(n—2)e 9" Ay; + ne Au + n(n — 2)e—™|dul?. 


Here, assuming g;x € C’ ° ~ H"?, the second and third terms on the right side of 
(C.18) are given by (C.16)-(C.17), the fourth belongs to L!, and 


(C.19) Sy = (e~2"gi*) Ricj, € HO”, Vp' < re 
he 

If we hypothesize more smoothness on u and g;;, we can write 

(C.20) be 8. 

For example, this holds provided 


(C21) u,gjk € H'?, with p>n, 


which we assume henceforth. Note that 


(C.22) g Ao; = g*uj.2 — g*ujue = Au — |dul?, 
so we have 
(C.23) S =e "($+ (2n — 2)Au+ (n— 1)(n — 2)|dul?]. 
Using (C.14) and (C.23) together, we have 
(C.24) 
— 2. & . 1 a 2 2 
Ricj, — Fn gk? = Rizr bn sik? t 5 gn \du| + (n— 2)Ajx, 


and similarly, in place of (C.15), 


i ,S 


1 


Ric’, — >—>6,59 = e™) Ric! ka 
(C.25) 2 2 2n—2 


—2.. ; 
ae 5H eldul? + (rn — 2)g%* Ace) 
These formulas imply 
(C.26) W ijk = Wigs 
where W* ijk 18 the Weyl tensor, defined by 
(C.27) 
1 
Wisk = R ijk + =o (5°; Rici, —6, Ric;; +Gik Ric! ; — Gij Ric? x] 


S oo ge 
+ ao Dmaa 0? #9 O° 79ik)- 
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This gives the following necessary condition for a metric tensor to be conformally 
flat, derived by Wey] (at least for C? smooth metrics): 


Proposition C.1. Assume n > 3 and gj, € H'”, p > n. For there to exist 
u € H! such that the metric tensor jk = e7"g;, has vanishing Riemann 


>! ke 
tensor, R jx, = 0, it is necessary that Wisk = 0. 


Proof. By (C.27) (or its analogue for W isn), it is clear that Re k = 0 implies 
W ae = 0, so the result follows by (C.26). 


The converse, for C? metric tensors, was proved by Schouten, with the follow- 
ing classical result. 


Proposition C.2. Assume g;x € C3, and n > 4. If Wisk = 0, then there exists 
u € C® such that jk = e?" 95, has vanishing Riemann tensor. 


It turns out that 
(C.28) n=3=> W'ijx = 0. 
The classical result in this case is the following. 
Proposition C.3. Assume gjp € C3 and n = 3. If the Cotton tensor di;;; van- 
ishes, then there exists u € C® such that Gin = e?"gi,, has vanishing Riemann 


tensor. 


We describe the Cotton tensor. For this, we elevate our smoothness hypothesis 
on the metric tensor to 


(C.29) 93k € C'NH*4, qe (1,00), 

which implies [%,; € C° 0 H'4, hence 

(C.30) Ry5x, Ricjn, Ric? ,, S, W ign € L4. 

We can take the covariant derivatives of these tensor fields, obtaining 
(C.31) Ruin € HO, 

and so forth. In this setting, we continue to have the Bianchi identity 
(C.32) RR yijk + Ro ojea + Ro oKig = 0. 


From (C.32) and (C.27) we obtain 
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(C.33) 
W? gre + W" jee, + Wiese 
1 


where %;;, is the Cotton tensor, given in terms of the Schouten tensor 


: S 
(C.34) Pir = Ricjx oy 2 Jak 
by 


As usual, pve = g™ Sue. Parallel to (C.31), we have 
(C.36) Lajn, Dap, Ged” ye € HO, 


under the hypothesis (C.29). Use of the Bianchi identity yields 


(C.37) W iiee = nao Di. 

In particular, 

(C.38) forn>3, Wiz, =0> Dijz = 0. 
We also have 

(C.39) for = 3, Vigg = Dige: 


The following provides a preliminary step towards establishing Propositions 
C.2-C.3. 


Proposition C.4. Assume g;x € H'”, p > n, and assume Wisk = 0. Further- 
more, assume there exists u € H'? satisfying the overdetermined system 


1 ( SO5k 


1 . 
(C40) tay = yyy — S.9juldul? + —5 (8 — Ricje). 


‘ = 
Then Gin = eg. has Riemann tensor R 4; = 0. 


Proof. From (C.24) we see that Ricjx = 0 when (C.40) holds. Applying (C.27), 


with Wisk replaced by Wes which, by (C.26), also vanishes, we have Rag = 
0. 
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We can reduce (C.40) to a first-order overdetermined system for a 1-form, 
namely 


(C41) Ujsk = VjUk — aie += , 5 (us, = Ricjx). 
In fact, suppose (C.41) is satisfied, with vu; € L? (p > n). Since the right side of 
(C.41) is symmetric in j and k, this implies the 1-form 5° v;dz; is locally exact, 
equal to du with u € H1? solving (C.40). 

The classical result is that, when gj, € C®%, the vanishing of the Weyl tensor 
and the Cotton tensor provides the integrability condition for the system (C.41). 


We state this formally. 


Proposition C.5. Assume gjx% € C®. If either n = 3 and Lijk = Oorn > 3 and 
Wisk = 0, then there exists a local C? solution to the system (C.41), hence a C® 


; - . =e 
solution u to (C.40), so 95k = e495, has Riemann tensor R 4;, = 0. 


The proof involves an elaboration of arguments used for Frobenius’s theorem. 
See [Ei] for details. If gj, € C™ for some integer m > 3, the argument yields 
u € C™. Once one has a vanishing Riemann tensor (hence a vanishing Ricci 
tensor), one can use local harmonic coordinates (for 9; ;), in which the metric 
tensor is C°°. Then Proposition 3.1 of Appendix B (Connections and Curvature) 
yields a coordinate system for which g,,, 1s constant in each patch. 

Arguments involving real Frobenius theorems seem not so effective for metric 
tensors with substantially less regularity than hypothesized in Proposition C.5. In 
§17 we describe work of [LiS2], making use of local n-harmonic coordinates, in 
which the Weyl tensor is seen to provide an overdetermined elliptic system for 
the conformally rescaled metric tensor. This leads to a substantial extension of 
Proposition C.2, to the case of a C” metric tensor, r > 1, with vanishing Weyl 
tensor. 
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Nonlinear Parabolic Equations 


Introduction 


We begin this chapter with some general results on the existence and regularity of 
solutions to semilinear parabolic PDE, first treating the pure initial-value problem 
in § 1, for PDE of the form 


(0.1) a = Iu+Fi(t,z,u,Vu), u(0) =f, 


where wu is defined on [0,7) x M, and M has no boundary. Some of the results 
established in § | will be useful in the next chapter, on nonlinear hyperbolic equa- 
tions. We also give a precursor to results on the global existence of weak solutions, 
which will be examined further in Chap. 17, in the context of the Navier-Stokes 
equations for fluids. 

In §2 we present a useful geometrical application of the theory of semilin- 
ear parabolic PDE, to the study of harmonic maps between compact Riemannian 
manifolds when the target space has negative curvature. 

In § 3 we extend some of the results of § 1 to the case OM # 0), when boundary 
conditions are placed on w. Section 4 is devoted to the study of reaction-diffusion 
equations, of the form 


Ou 
(0.2) ap = but X(u), 


where u takes values in R‘ and X is a vector field on R*. Such systems arise in 
models of chemical reactions and in mathematical biology. One way to analyze 
the interplay of diffusion and the reaction due to X(u) in (0.2) is via a nonlinear 
Trotter product formula, discussed in § 5. 

In 86 we examine a model for the melting of ice. The source of the nonlin- 
earity in this problem is different from those considered in §§ 1-5; it is due to the 
equations specifying the interface where water meets ice, a “moving boundary.” 
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In §§7-9 we study quasi-linear parabolic PDE, beginning with fairly 
elementary results in § 7. The estimates established there need to be strengthened 
in order to be useful for global existence results. One stage of such strengthening 
is done in § 8, using the paradifferential operator calculus developed in § 10 of 
Chap. 13. We also include here some results on completely nonlinear parabolic 
equations and on quasi-linear systems that are “Petrowski-parabolic.” 

The next stage of strengthening consists of Nash—Moser estimates, carried out 
in § 9 and then applied to some global existence results. This theory mainly applies 
to scalar equations, but we also point out some & x & systems to which the Nash— 
Moser estimates can be applied, including some systems of reaction-diffusion 
equations in which there is nonlinear diffusion as well as nonlinear interaction. 


1. Semilinear parabolic equations 


In this section we look at equations of the form 


(1.1) ot = bu + F(t,2,4, Vu), u(0) = f, 

for u(t, x), a function on [0,7] x M. We assume WM has no boundary; the case 
OM # @ will be treated in §3. Generally, L will be a second-order, negative- 
semidefinite, elliptic differential operator (e.g., L = vA), where A is the Laplace 
operator on a complete Riemannian manifold MM and v is a positive constant. We 
suppose F’ is C'°° in its arguments. 

We will begin with very general considerations, which often apply to an even 
more general class of linear operators L. For short, we suppress (t, x)-variables 
and set 

®(u) = F(u, Vu). 


We convert (1.1) to the integral equation 


(1.2) u(t) = ef + a e—9)/6(u(s)) ds = Vu(t). 
0 


We want to set up a Banach space C'([0, T], X ) preserved by the map W and estab- 
lish that (1.2) has a solution via the contraction mapping principle. We assume that 
f © X, a Banach space of functions, and that there is another Banach space Y 
such that the following four conditions hold: 


(1.3) e” : X + X is a strongly continuous semigroup, for t > 0, 
(1.4) ® : X — Y is Lipschitz, uniformly on bounded sets, 
(1.5) eve ¥ =X, fort > 0, 


and, for some y < 1, 


1. Semilinear parabolic equations 337 
(1.6) len“ Ilea.x) < Ct, fort € (0, 1]. 


We will give a variety of examples later. Given these conditions, it is easy to see 
that W acts on C((0, 7], X), for each T > 0. Fix a > 0, and set 


(1.7) Z= {ue O([0,T], X) : u(0) = f, lu) — filx < a}. 
We want to pick T’ small enough that YW : Z — Z is a contraction. By (1.3), we 
can choose 7} so that |le’“ f — f||x < a/2 fort € [0,T;]. Now, if u € Z, then, 


by (1.4), we have a bound ||®(u(s))||y < Ay, for s € [0, Ti], so, using (1.6), we 
have 


t 
(1.8) I / el I/O (u(s)) ds 
0 


te OM mee 
x 


If we pick T, < T, small enough, this will be < a/2 for t € [0,72]; hence 
W:Z— Z, provided T < Tb. 
To arrange that W be a contraction, we again use (1.4) to obtain 


||®(u(s)) — ®(v(s)) lly < Kllu(s) — v(s)|Lx, 
for u,v € Z. Hence, for ¢ € [0, Ta], 


IN) — VOVOx =] [et [B(s)) - B(e(9))] as. 


< Crk sup ||u(s) — v(s)||x; 


(1.9) 


and now if T’ < T> is chosen small enough, we have Cy < 1, making Va 
contraction mapping on Z. Thus W has a unique fixed point wu in Z, solving (1.2). 
We have proved the following: 


Proposition 1.1. [f X and Y are Banach spaces for which (1.3)-(1.6) hold, then 
the parabolic equation (1.1), with initial data f € X, has a unique solution u € 
C([0,T], X), where T > 0 is estimable from below in terms of ||f \|.x- 


As an example, let M be a compact Riemannian manifold, and consider 
(1.10) xX=C'(M),. Y=C(Mt). 
In this case we have the conditions (1.3)—(1.6) if L = A. In particular, 
(1.11) lle“AIlecaoy < Ct, fort € (0,1). 
Thus we have short-time solutions to (1.1) with f € C1(M). 


It will be useful to weaken the hypothesis (1.3) a bit. Consider a pair of Banach 
spaces X and Z of functions, or distributions, on /, such that there are continu- 
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ous inclusions 
Cxo(M) CX CZCD'(M). 


We will say that a function u(t) taking values in X, for t € I, some interval in R, 
belongs to C(I, X) provided u(t) is locally bounded in X, and u € C(I, Z). More 
generally, this defines C(I, X) for any locally compact Hausdorff space I. Then 
we say e'” is an almost continuous semigroup on X provided e'” is uniformly 
bounded on X for t € [0,7], given T < oo, e&t94u = eeu, for each 


u € X, s,t € [0, oo), and 
ue X = eu €C((0,00), X). 


Examples include e’“ on L®(M) and on Holder spaces C'(M), r € R* \ Zt, 
when M is compact. The space C(I, X) may depend on the choice of Z, but 
we omit reference to Z in the notation. For example, when we consider e’* on 
L°(M), with M compact, we might fix p < oo and take Z = L?(M). 

The proof of Proposition 1.1 readily extends to the following variant: 


Proposition 1.1A. Let X and Y be Banach spaces for which (1.4)-(1.6) hold. In 
place of (1.3), we assume e‘” is an almost continuous semigroup on X. Also, we 
augment (1.4) with the condition that ® : C(I, X) — C(I, Y). Then the initial- 
value problem (1.1), given f © X, has a unique solution u € C([0,T], X), where 
T > 0 is estimable from below in terms of || f || x. 


As examples, we can consider 
(1.12) A=C(mM), YHOU, 
r > 0. If r is not an integer, these are Hélder spaces. We have, for any s > 0, 
(1.13) lle" leerorey < C8", Wes 1. 
It follows from (1.2) that if f € C”*+! and one has a solution u in the space 
C([0, T],C"**), then actually, for each t > 0, u(t) € C™t® for every s < 2. 
We can iterate this argument repeatedly, and also, via the PDE (1.1), obtain the 


regularity of t-derivatives of u, proving: 


Proposition 1.2. Given f € C'(M), L = A, the equation (1.1) has, for some 
T > 0, a unique solution 


(1.14) u € C([0,T],C'(M)) NC™((0,T] x M). 
A number of different pairs X and Y can be constructed; it is particularly of 


interest to have results for cases other than X = C!(M), Y = C(M), as these are 
often useful for establishing the existence of global solutions. When (1.4) holds 
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depends on the nature of the nonlinearity in (1.1). We list here some estimates that 
bear on when (1.6) holds, in case L = A. The bound in the right column is on the 
operator norm over 0 < t < 1. In the cases listed here, we assume that p > q, and 
S>r. 


(1.15) ¥ xX bound on |le’“|| ¢(y,x) 
L1(M) L?(M) Cr /2)0/a-1/2), 
H™?(M) H*?(M) Gi ae 
A" 4(M) H*?(M) Ct (n/2)(/q-1/p)—(1/2)(s—r), 


We now take a look at the case F(u, Vu) = >), 0jFj(u) of (1.1), with L = 
vA; that is, 


a 
(1.16) 7 = ee rate) u(0) = f. 


For simplicity, we take 14 = T”. The limiting case v = 0 of this, which we 
will consider in § 5 of the next chapter, includes important cases of quasi-linear, 
hyperbolic equations. We will assume each F; is smooth in its arguments (w can 
take values in R* ) and satisfies estimates 

(1.17) |Fi(u)| < Clu), |VEj(u)| < C(u)?}, 

for some p € [1, 00). We will show that the Banach spaces 

(1.18) X = L1(M), Y = H-14/?(M) 

satisfy the conditions (1.3)—(1.6) for a certain range of q. First, we need g > p, so 
q/p > 1 in (1.18). Only (1.4) and (1.6) need to be investigated. For (1.4) we need 
P,: Lt L4/® to be locally Lipschitz. To get this, write 


Fy(u) — Fj(v) = G5(u, v)(u— v), 


(1.19) 1 
G;(u, v) =| F;(su+(1—s)v) ds. 
0 


By (1.17), we have an estimate on ||Gj(u, v)||,a/~@-1, and, by the generalized 
Holder inequality, 

(1.20) Fi(u) — Fi) Mzae S [G5llna@—-n |lu — vllze, 

so we have (1.4). To check (1.6), we use the third estimate in (1.15), to get 


(1.21) let Ilear-nare,nay < Ct /2@/a-V/a)-1/2, 
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for 0 < t < 1, so we require n(p — 1)/q < 1. Therefore, we have part of the 
following result: 


Proposition 1.3. Under the hypothesis (1.17), if f © L4(M), the PDE (1.16) has 
a unique solution wu € C([0,T], L9(M)), provided 


(1.22) q>p and q>n(p—1). 
Furthermore, u € C™°((0, 7] x M). 


It remains to establish the smoothness. First, replacing LY by L% in (1.21), we 
see that, for any t € (0, 7], u(t) € L™ for alla < ¢/(p—q/n). Asp—q/n <1, 
this means q; exceeds q by a factor > 1. Iterating this gives u(t) € L%, where 
qj exceeds q;_, by increasing factors. Once you have q; > np, the next iteration 
gives u(t) € C”™(M), for some r > 0. Now, consider the spaces 


(1.23) X=O"(M), Y=H?-!-*4(M), 


where q is chosen very large, and ¢ > 0 very small. The fact that u +> F;(u) 
is locally Lipschitz from C"(M) to C™(M), hence to H”~*-?(M), gives (1.4) 
in this case, and estimates from the third line of (1.15), together with Sobolev 
imbedding theorems, give (1.6), and furthermore establish that actually, for each 
t > 0, u(t) € C™(M), for r1 — r > 0, estimable from below. Repeating this 
argument a finite number of times, we obtain u(t) € C'/(M), with r; > 1. At 
this point, the regularity result of Proposition 1.2 applies. 
We can now establish a global existence theorem for solutions to (1.16). 


Proposition 1.4. Suppose F; satisfy (1.17) with p = 1. Then, given f € L?(M), 
the equation (1.16) has a unique solution 


(1.24) u € C([0, 00), L?(M)) NC™((0,00) x M), 
provided, when u takes values inR*, F;(u) = (F1,(u),...,F*;(u)), that 


(1.25) 2 ue ile 1<i,k<K. 
Ou; Our 


Proof. We have u € C({0,7], £7) 9 C™°((0,T) x M), since (1.22) holds with 
q = 2. To get global existence, it therefore suffices to bound ||u(t)||z2; we prove 
this is nonincreasing. Indeed, for t > 0, 


© \u(é)|Re = 2(u(t), > O;F;(u(t))) — 2v||Vu(e)|lz2 


(1.26) dt 
< 2(u(t), $> Oj) Fj(u@)). 
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Now by (1.25) there exist smooth G; such that Pe, = 0G; /Oux, and hence the 
right side of (1.26) is equal to 


(1.27) 2 f ae,(w du = 0. 


The proof is complete. 


The hypothesis (1.25) implies that the v = 0 analogue of (1.16) is a symmetric 
hyperbolic system, as will be seen in the next chapter. 

The condition p = 1 for (1.17) is rather restrictive. In the case of a scalar equa- 
tion, we can eliminate this restriction, at least for bounded initial data, obtaining 
the following important existence theorem. 


Proposition 1.5. If (/.16) is scalar and f € L°°(M), then there is a unique 
solution 


u € L*((0,00) x M) NC (0,00) x M), 
such that, ast \, 0, u(t) > f in L?(M) for all p < ow. 


Proof. Suppose || f||p-° < M. Alter F;(u) on jul > M + 1/2, obtaining F;(u), 
constant on u < —M —1andonu > M +1. Then Proposition 1.4 yields a global 
solution u to the modified PDE. This u solves 


(1.28) ou =vAut ) a;(t,2)dju, a;(t,x) = Fi(u(t,2)), 


so the maximum principle for linear parabolic equations applies; ||w(t)|| 2 is 
nonincreasing. Thus ||u(t)||;0 < M for all t, and hence wu solves the original 
PDE. 


The solution operator produced from Proposition 1.5 has an important 
L*-contractive property, which will be useful for passing to the v = 0 limit 
in § 6 of the next chapter. We present an elegant demonstration from [Ho]. 


Proposition 1.6. Let u; be solutions to the equation (1.16) in the scalar case, 
with initial data u;(0) = f; € L°°(M). Then, for each t > 0, 


(1.29) \|ua(t) — ua(t)IInacmy S lft — fellzaqan- 


Proof. Set v = u; — ug. Then v solves 


a 
(1.30) Or = vAv + )~;[®;(ur, u2)v], 


with 


1 
®; (ui, U2) = | Fi (su; + (1 — s)uz) ds, 
0 


342 15. Nonlinear Parabolic Equations 


sO F;(u1) = F;(u2) = ®; (ur, we) (ur = ug). Set G(t, x) = ®; (ui, 2). Now, 
for given T > 0, let w solve the backward evolution equation 


3) 
(1.31) a = —vAw+ \°G;(t,2)0j;w, w(T) = wy € C%(M). 
Then w(t) is well defined for ¢ < T, and the maximum principle yields 
(1.32) || w(t) || Le < ||wollz~, for t — T. 


Note that ||v(Z)||z1 is the sup of (v(T), wo) over ||wol|z~° < 1. Now, fort € 
(0,7), we have 


<0, w) = (vAv,w) + >> (0;(G5r), w) 
— (v,vAw) + x, G;0;w) = 0. 


(1.33) 


Since (v(0), w(0)) < |]v(0)||z1||w(0)||L~, this proves (1.29). 


We next produce global weak solutions to (1.16), for K x K systems, with the 
symmetry hypothesis (1.25), in case (1.17) holds with p = 2. As before, we take 
M = T”. We will use a version of what is sometimes called a Galerkin method 
to produce a sequence of approximations, converging to a solution to (1.16). 

Give < > 0, define the projection P- on L?(M) by 


P.f(z)= > flke**, 


|k|<1/e 


where, fork € Z”, f (k) form the Fourier coefficients of f. Consider the initial- 
value problem 


Ouse 


(1.34) - 


= vP.AP-u- + P. S- 0;F;(Peuc), ue(0) = P-f. 


We take f € L?(M). For each ¢ € (0, 1], ODE theory gives a unique short-time 
solution, satisfying u-(t) = P-u_(t). Furthermore, 


d 
(1.35) ae Ilue(t) N72 = 2v(P-AP-ue, Ue) + 2) 0 (P.OjFj(Pette), Ue). 
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The first term on the right is —2v||VP-u-(t)||7.2 < 0. The last term is equal to 
25° (O;Fj(Petc), Pete) = —2 9° (F; (Pete), 0; Pete) 


(1.36) 7 2 f aj[e (Pette)] 


=0, 
where G'; is as in (1.27). We deduce that 

(1.37) llue(t)llz2 < [I fllze- 

Hence, for each ¢ > 0, (1.34) is solvable for all t > 0, and 

(1.38) {ue : € € (0, 1]} is bounded in (Rt, L?(M)). 


Note that further use of (1.35)-(1.37) gives 


T 
(1.39) 2v | |VPeue(t)|l72 dt = || Pe fll? — llue(T) Ize, 
0 
for any T € (0,00). Hence, for each bounded interval J = [0,7], since 
PeUe = Ue, 
(1.40) {u-} is bounded in L?(I, H*(M)). 


Given that |F(u)| < C(u)?, it follows from (1.38) that 


{F;(P-ue)} is bounded in L° (Rt, L'(M)) 


141 
oe ciM(Rt,A 4M), 


for each 6 > 0. Now using the evolution equation (1.34) for Ou, /Ot and (1.40)— 
(1.41), we conclude that 


(1.42) {ots } is bounded in L2(I, H~"/2-1-5(M)), 
hence 
(1.43) {uz} bounded in H'(I, H~"/?-!~°(M)). 


Now we can interpolate between (1.40) and (1.43) to obtain 


(1.44) {ue} bounded in H*(I, Ht~8("/2+149)()), 
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for each s € [0, 1]. Now if we pick s > 0 very small and apply Rellich’s theorem, 
we deduce that 


(1.45) {ue :0 <e <1} is compact in L*(I, H'~7(M)), 


for all y > 0. 
The rest of the argument is easy. Given T’ < oo, we can pick a sequence 
Up = Ue, Ek — O, such that 


(1.46) uz > u in L?((0,T],4'~7(M)), in norm. 


We can arrange that this hold for all T’ < oo, by a diagonal argument. We can also 
assume that u;, is weakly convergent in each space specified in (1.38) and (1.40), 
and that Ou, /Ot is weakly convergent in the space given in (1.42). From (1.46) 
we deduce 


(1.47) F;(P2,Ue,) 3 Fj(u) in L*((0,7], £*(M)), in norm, 
as k — oo, hence 
(1.48) 0; Fj (Pe,Ue,) 2 0;Fj(u) in L1((0,T], 47+" (M)). 


Using H-1!(M) c H~-"/?-1~9(M), we see that each term in (1.34) converges 
as €, — 0. We have proved the following: 


Proposition 1.7. If |F;(u)| < C(u)? and |VF;(u)| < C(u), then, for each f € 
L?(M), a K x K system of the form (1.16), satisfying the symmetry hypothesis 
(1.25), possesses a global weak solution 


u € L®(R*, L?(M)) L2.(R*, H*(M)) 


(1.49) 
A Lip, (R*, H-?(M) + H-"/2-1-9(M)). 


When reading the discussion of the Navier-Stokes equations in Chap. 17, one 
will note a similar argument establishing a classical result of Hopf on global weak 
solutions to that system. 


Exercises 


1. Verify the estimates on operator norms of e’“ : Y — X listed in (1.15). 
2. Show that, given f € C'(M), M =T", the solution to 


Ou 


a vAu+ F(t,u,Vu), u(0) = f, 


continues to exist as long as ||u(t)||z- does not blow up, provided this equation is 
scalar and F,, < 0. (Hint: Derive a PDE for uj; = Ou/Ox; and apply the maximum 
principle.) 
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3. Generalize the treatment of PDE (1.16), done above in the case M = T”, to the 
following situation for a general compact Riemannian manifold /: 
ou = vAu+ div F(u), 
where 
(a) u is scalar and F'(u) = F(a,u) € T,M, forx € M, wER, 
(b) u is a vector field and F(u) = F(a, u) € @°T,M, fora € M, ue Tr M. 
Consider other generalizations. 
4. More generally, extend the treatment of (1.16) to 


where D is a first-order differential operator on the compact Riemannian manifold 
M, and F satisfies the estimates (1.17). What additional properties must D have for 
Proposition 1.4 to extend? 

5. For given, ¢ > 0, k € Z*, consider the (4k)-order operator 


L=vA—cA”*. 


Show that ||e*” || cc, x) satisfies estimates of the form (1.15), where, in the right col- 
umn, one replaces t~7 by t77/?* and C depends on v and e€. 
6. Consider the PDE 
Ou 


(1.50) ap = VAuH cA**u+ S-d;Fi(u), (0) = f. 


Suppose F; satisfies (1.17), that is, 
(1.51) [Fi (u)| < C(uy?,  |VFj(u)| < C(u)P™. 
Show that there is a unique local solution 
u € C((0,T], L4(M)) nC™((0,T] x M) 
given f € L7(M), provided 


n(p — 1) 


>p and q> Pa). 
SSPE aed 


7. Suppose that (1.51) holds with p = 2 and that dim M = n < 8k — 2. Suppose 
also that the symmetry hypothesis (1.25) holds (for a K x K system), so F*;(u) = 
OG; /Oux. Given f € L?(M), show that (1.50) has a unique global solution u € 
C([0, 00), L?(M)) N C(O, 00) x M), and ||u(t)||z2 < || f|l?.2, for t > 0. 

8. Let u = ue be the solution to (1.50) under the hypotheses of Exercise 7. We take 
v > 0 fixed and let ¢ \, 0. Obtain bounds on {ue : € € (0,1]} which imply that 
a subsequence converges to a weak solution uo of the ¢ = 0 case of (1.50), thus 
providing another proof of Proposition 1.7. (Hint: Start with the following analogue of 
(1.39): 


sh T 
av 7 || Vue (t)|[22 dt + 2 | |A®ue(t)|[22 dt = |[fll22 — [Ive (T)II22-) 
0 0 


346 15. Nonlinear Parabolic Equations 


9. Let u be a smooth solution for 0 < t < T of the system (1.16), under the hypothesis 
(1.25) of Proposition 1.4, namely, 


(1.52) ou =vAu+ 5 d;Fi(u), Ou; F*; = Ou,F";, u(0) = f. 


Thus F*; = 0,,,,G;. Show that 
£ |Veu(t)|l2 = —2||Aullze 27 f (Oxy AueGj(u)) Arun Bus de 
dt x L L Upvui 7) i % 
1 2 
< ~20||Aullde + £(llu(t)|ln=)"Voulds + vlAulles, 


where 


(1.53) OM) = Gap lO Oug sku) = sap. [lu Hal) | 


jul<M jul< 


Integrating this and using the estimate on v & ||Vcu(r)||7.2 dr that follows from 
(1.26)-(1.27), deduce that, forO < t < T, 


t 
(1.54) ||Veu(t)||Z2 + vf |Au(r)|lZ2 dr < [|Veflli2 + v7 W(u, t)" || Fllz2, 
0 


where U(u, t) = 6(M), with M = sup{||u(r) |b : 0 < 7 < th. 

10. In the context of Exercise 9, suppose that the space dimension is n = 1. Note that in 
this case, ||u(t)||Z00 < Cl|Veu(t)||z2||u(O)||,2 + Cllu(t)||Z2. Show that under the 
hypothesis 

6(M)M~* —0, as M > +00, 


we have a bound on ||u(t)||;71 ast 7 T,, and hence a global existence result for (1.52). 
Compare with [Smo], p. 427. 

11. Solve the system 

Ut = Une +u(uz +z), Ve = Vex + 0(uz +2), 


(1.55) ; 
u(0,x) = Acoskx, v(0,2) = Asinka, 


with A > 1. Here, x € S$ = R/27Z. Show that the maximal t-interval of existence 
in R® is [0, C(A)/k?). This example is given in [LSU] and attributed to E. Heinz. 
12. Consider the multidimensional “Burger’s equation” 


(1.56) ut+Vuu=Au, u(0,x) = f(z), 
for u(t, x) : Rt x T” — R”. Show that, for each t > 0, 


sup |u;(t,x)|< sup |f;(@)|, 1<j<n. 
xwETr “xET™ 


Deduce that (1.56) has a global solution. (Hint: Show that 


d a 
qlee SCY DS |! uj)(D*ue)|l 2 Mell aetr — Vall zee, 
G6 |a|+|B1Sk 


and use Proposition 3.6 of Chap. 13 to estimate ||(D°u;)(D°ue)|| 22.) 
Note that the case n = 1 is also treated by Proposition 1.5. 
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2. Applications to harmonic maps 


Let M and N be compact Riemannian manifolds. Using Nash’s result, proved 
in §5 of Chap. 14, we take N to be isometrically imbedded in some Euclidean 
space; N C R*. A harmonic map u : M — N is a critical point for the energy 
functional 


(2.1) Blu) = 5 , [Vu(e) 2 aV (2), 
M 


among all such maps. In the integrand, we use the natural square norm on T* M ® 

Tu(2)N C TIM @ R*. The quantity (2.1) clearly depends only on the metrics on 

M and N, not on the choice of isometric imbedding of NV into Euclidean space. 
If us is a smooth family of maps from M to N, then 


(2.2) < a) = | v(e)Au(a) dV, 


where u = uo, and v(x) = (0/0s)us(a) € Ty) N. One can vary uo so that v is 
any map M -+ R¥ such that v(x) € T. u(x) N, so the stationary condition is that 


(2.3) Au(x) L Tyx)N, for all 2 € M. 


We can rewrite the stationary condition (2.3) by a process similar to that used in 
(11.12)-(11.14) in Chap. 1. Suppose that, near a point z € N C R*, N is given by 


(2.4) fey) =0, 1<f<1, 


where L = k — dim N, with V fe(y) linearly independent in R*, for each y 
near z. If u: MM — N is smooth and u(x) is close to z, then we have 


Ofe Ou é 

2. = 1<@<L,1<j< 
(2.5) Yue Bap 7% SASL, USGSm, 
where (21, ...,@m) is a local coordinate system on M. Hence 

0 , OF Ou, OUp 
(2.6) a je _O Fe Oty Ott 

Ou, Ou,OU, Ox, OX; 

v H,V,j,k 


Since {V,fe(y) : 1 < 0 < L} isa basis of the orthogonal complement in R* 
of T,,N, it follows that, for smooth u : M — N, the normal component of Au 
depends only on the first-order derivatives of u, and is quadratic in Vu; that is, 
we have a formula 


(2.7) (Au)% =T(u)(Vu, Vu). 
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Thus the stationary condition (2.3) for u is equivalent to 
(2.8) Au —T(u)(Vu, Vu) = 0. 


Denote the left side of (2.8) by r(u); it follows from (2.7) that, given 
u € C?(M, N), T(u) is tangent to N at u(z). 
J. Eells and J. Sampson [ES] proved the following result. 


Theorem 2.1. Suppose N has negative sectional curvature everywhere. Then, 
given v € C~(M,N), there exists a harmonic map w € C%(M,N) which 
is homotopic to v. 


As in [ES], the existence of w will be established via solving the PDE 


(2.9) au = Au—T(u)(Vu, Vu), (0) = v. 

It will be shown that under the hypothesis of negative sectional curvature on 
N, there is a smooth solution to (2.9) for all t > O and that, for a sequence 
ty, — 00, u(t,) tends to the desired w. In outline, our treatment follows that pre- 
sented in [J2], with some simplifications arising from taking NV to be imbedded in 
R* (as in [Str]), and also some simplifications in the use of parabolic theory. 

The local solvability of (2.9) follows directly from Proposition 1.2. Since r(u) 
is tangent to N for u € C™®(M, N), it follows that u(t) : M — N for each t in 
the interval [0, 7’) on which the solution to (2.9) exists. To get global existence for 
(2.9), it suffices to estimate ||u(t)||c1. 

In order to estimate Vu, we use a differential inequality for the energy density 


1 
(2.10) e(t, 2) = 5lVeult, x)’. 
In fact, there is the identity 
1 J 
— —Ae= —|*V*ul? — 5 (du - Ric (e;), du - e;) 
+ (RN (du -e;, du-e,)du- ex, du- ej), 


where {e;} is an orthonormal frame at T,, 4 and we sum over repeated indices. 
The operator ’ V? is obtained from the second covariant derivative: 


NV?u(az) : @’?TyM —> Tua) N. 


See the exercises for a derivation of (2.11). 
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Given that NV has negative sectional curvature, (2.11) implies the inequality 


Oe 
2 eee 
(2.12) at Ae < ce. 


If f(t,v) = e~“e(t, x), we have Of /Ot — Af < 0, and the maximum principle 
yields f(t,2) <||f(0,-)llz~, hence 


(2.13) e(t, x) < e||Vul|z.00. 
This C!-estimate implies the global existence of a solution to (2.9), by Proposition 
Per 


For the rest of Theorem 2.1, we need further bounds on u, including an 
improvement of (2.13). For the total energy 


(2.14) E(t) = f ett,2) dV (x) = 5 | lvuP avin), 


M M 


we claim there is the identity 


(2.15) BGy= -{ |u|? dV (2). 
M 


Indeed, one easily obtains E’(t) = — f (uz, Au) dV(x). Then replace Au by 
uz +T(u)(Vu, Vu). Since u; is tangent to N and T'\(u)(Vu, Vu) is normal to N, 
(2.15) follows. The desired improvement of (2.13) will be a consequence of the 
following estimate: 


Lemma 2.2. Let e(t, x) > 0 satisfy the differential inequality (2.12). Assume that 


E(t) = feta) dV (x) < Ep 
is bounded. Then there is a uniform estimate 
(2.16) e(t,z)<e°K Ey, t>1, 
where Ix depends only on the geometry of M. 


Proof. Writing 0e/0t — Ae = ce — g, g(t, x) > 0, we have, for0 < s <1, 


(2.17) e(t + 8,0) =e AMe(t, x) — ; ele ES ge) dr 


< eA+t)e(t, 2). 
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Since e*(4+°) is uniformly bounded from L1(M) to L°(M) for s € [1/2, 1], the 
bound (2.16) for t € [1/2, 00) follows from the hypothesized L'-bound on e(t). 


We remark that a more elaborate argument, which can be found on pp. 84-86 
of [J2], yields an explicit bound K depending on the injectivity radius of M and 
the first (nonzero) eigenvalue of the Laplace operator on VM. 

Since Lemma 2.2 applies to e(t, 7) = |Vu|? when wu solves (2.9), we see that 
solutions to (2.9) satisfy 


(2.18) l|u(t)||o1r < Kylul|or, for allt > 0. 

Hence, by the regularity estimate in Proposition 1.2, there are uniform bounds 
(2.19) llu(t)llce < Kellullos, t 21, 

for each £ < oo. Of course there are consequently also uniform Sobolev bounds. 

Now, by (2.15), E(t) is positive and monotone decreasing as t 7 oo. Thus 
the quantity [,,, |ue(t, x)|? dV(«) is an integrable function of ¢, so there exists a 
sequence t; — oo such that 
(2.20) ||we(t;,-) ||b2 > 0. 

From (2.19) and the PDE (2.9), we have bounds 
I|ue(t, -)|lare < Cr, 
and interpolation with (2.20) then gives, for any @ € ZT, 
(2.21) ||we(tj,-) || ere 4 O. 
Therefore, by the PDE (2.9), one has for u;(x) = u(t;, 2), 
(2.22) Au; —T(u;)(Vu;,Vu;) 30 in H*(M), 
as well as a uniform bound from (2.19). It easily follows that a subsequence 
converges in a strong norm to an element w € C®(M,N) solving (2.8) and 
homotopic to v, which completes the proof of Theorem 2.1. 

We next show that there is an energy-minimizing harmonic map w : M — N 
within each homotopy class when N has negative sectional curvature. 
Proposition 2.3. Under the hypotheses of Theorem 2.1, if we are given v € 
C™(M,N), then there is a smooth map w : M — N that is harmonic, and 


homotopic to v, and such that E(w) < E(®) for any 0 € C®(M,N) homo- 
topic to v. 
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Proof. If a is the infimum of the energies of smooth maps homotopic to v, pick 
vy, homotopic to v, such that E(v,) \, a. Then solve (2.9), for u,, with initial 
data u,(0) = v,. We have some sequence u,(t,;) + wy € C%(M, N), har- 
monic. The proof of Theorem 2.1 gives E(w,) < E(v,), hence E(wy) > a. 
Also, via (2.16) and (2.19), we have uniform C’-bounds on w,, for all @. Thus 
{w_} has a limit point w with the desired properties. 


We record a local existence result for parabolic equations with a structure like 
that of (2.9), with initial data less smooth than C!. Thus we look at equations of 
the form (1.1), with 


(2.23) F(a, Diu) = B(u)(Vu, Vu), 
a quadratic form in Vu. In this case, we take 


(2.24) X=M?, Y=1%, q= a p>n, 


and verify the conditions (1.3)—(1.6), using the Sobolev imbedding result 
HePc praise), p< nie 
8 
This yields the following: 


Proposition 2.4. If (2.23) is a quadratic form in Vu, then the PDE 


(2.25) au = Au+ B(u)(Vu, Vu), u(0) = f, 


has a solution in C({0,T], H+?) A C%((0,T) x M), provided f € H'?(M), 
pn. 


The smoothness is established by the same sort of arguments as described 
before. Of course, the proof of Proposition 2.4 yields persistence of solutions as 
long as ||u(t)|| 71.» is bounded for some p > n. 

We mention further results on harmonic maps. First, in the setting of 
Theorem 2.1, that is, when N has negative sectional curvature, any harmonic 
map is energy minimizing in its homotopy class, a fact that makes Proposition 2.3 
superfluous. An elegant proof of this fact can be found in [Sch]. It is followed by a 
proof of a uniqueness result of P. Hartman, which says that under the hypotheses 
of Theorem 2.1, any two homotopic harmonic maps coincide, unless both have 
rank < 1. 

Theorem 2.1 does not extend to arbitrary N. For example, it was established 
by Eells and Wood that if v € C®°(T?, S”) has degree 1, then v is not homotopic 
to a harmonic map. Among positive results not contained in Theorem 2.1, we 
mention a result of Lemaire and Sacks—Uhlenbeck that if 72(NV) = 0 and dim 


352 15. Nonlinear Parabolic Equations 


M = 2, then any v € C™~(M, N) is homotopic to a smooth harmonic map. If 
dim M > 3, there are nonsmooth harmonic maps, and there has been considerable 
work on the nature of possible singularities. Details on matters mentioned in this 
paragraph, and further references, can be found in [Hild, J1, Str, Str2]. We also 
refer to [Ham] for extensions of Theorem 2.1 to cases where / and N have 
boundary. 

In case M and N are compact Riemann surfaces of genus > 2 (endowed 
with metrics of negative curvature, as done in §2 of Chap. 14), harmonic maps 
of degree 1 are unique and are diffeomorphisms, as shown by R. Schoen and 
S.-T. Yau. They measure well the degree to which M/ and N may fail to be con- 
formally equivalent, and they provide an excellent analytical tool for the study of 
Teichmuller theory, replacing the more classical use of “quasi-conformal maps.” 
This material is treated in [Tro]. 

We mention some other important geometrical results attacked via parabolic 
equations. R. Hamilton [Ham2] obtained topological information on 3-manifolds 
with positive Ricci curvature and in [Ham3] provided another approach to the 
uniformization theorem for surfaces, an approach that works for the sphere as well 
as for surfaces of higher genus; see also [Chow]. S. Donaldson [Don] constructed 
Hermitian—Einstein metrics on stable bundles over compact algebraic surfaces; 
see [Siu] for an exposition. Some facets of the Yamabe problem were treated via 
the “Yamabe flow” in [Ye]. 

Hamilton’s Ricci flow equation 


is a degenerate parabolic equation, but D. DeTurk [DeT] produced a strongly 
parabolic modification, which fits into the framework of § 7 of this chapter, giv- 
ing short time solutions. Solutions typically develop singularities, and there has 
been a lot of work on their behavior. Work of G. Perelman, [Perl ]—[Per3], was 
a tremendous breakthrough, greatly refining understanding of the Ricci flow and 
using this to prove the Poincaré Conjecture and Thurston’s Geometrization Con- 
jecture, for compact 3-dimensional manifolds. This work has generated a large 
additional body of work, quite a bit of it devoted to giving more digestible pre- 
sentations of Perelman’s work. We refer to [CZ] and [MT] for such presentations, 
and other references. 


Exercises 


For Exercises 1-3, choose local coordinates x near a point p € M and local coordi- 
nates y near g = u(p) € N. Then the energy density is given by 


1 Our Ou ke 
2.2 t,t) ==> hela 
(2.26) elt, a) = 5 Sut Set gM (a) hyw(ult,2)), 
where u(x) = (wi(x),-+- ,Un(a)) in the y-coordinate system, n = dim N. Here, 


gre and h,,, define the metrics on VM and N, respectively, and we use the summation 
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convention, here and below. Assume the coordinate systems are normal at p and q, 
respectively. 
1. Using these coordinate systems, show that the PDE (2.9) takes the form 


Ouv ke Ou, ke Myj  OUv 
2.27 = I? pe 
ee) Ot g Ox,OXe g nS Ox; Te Oa, Oxo’ 


where “I? and NT” » are the connection coefficients of M and N, respectively. 
2. Differentiating (2.27), show that, at p, 


+ _ j OP 9K; 8" grr Ouv 
Ot Ox¢ Ox,0xX,OX¢ 21OxrpOx0  Ox,Oxe Ox;Oxe | Ox; 
1 O° Ay, O° hav O*hro Oug Our Oua 


2 |Buadue | OyrOug  Ouvdya| Ore Ox Orn 


O Ow OPur 1 0? 9k; 
(2.28) 


3. Using (2.28), show that, at p, 


2 2 
Ov uv Oru 


Oe hee: 
Ot Ox ;0X% Ox;OLk 
(2.29) 1 87 ger 0? 51 8 ge: 0" gie (Fe a) 
2|O0xvi0x;, OxeOr, OxrOxrK, OxXiOTE Oxe OXk 
tl Oh, he OP? hy r OP pv Oup Our Quy Oup 
‘a ee  OypOy — OypOyw cae | (Se Ox; OxK se). 


Obtain the identity (2.11) by showing that this is equal, at p, to 


1 mM OUp Ouy lin OUp Ou Ou, Ou, 


Nwo2, 2 : 
Vul? + = Rick ‘ 
[Vall +5 Ritsk oo aa, ~ 2 Ge, Oxy Dan Btn 


To define “ V?u(x), let F + M denote the pull-back u*T'N, with its pulled-back 
connection V. To Du : TM —,TN we associate Du € C®(M,T* @ E). If V* 
denotes the product connection on T* M ® E, we have 


(2.30) NV*?u = V*Du € C™(M,T* @T* ® E). 


Compare the construction of second covariant derivatives in Chap.2, §3, and 
Appendix C, § 2. 

If N C R*, let F — M be the pull-back u7*R*, with its pulled-back (flat) con- 
nection V°. We have Du € C®(M,T* ® E) C C®(M,T* ® F) and 


(2.31) Vu = V°Du € C*(M,T* @T* @ F), 
obtained by taking the Hessian of u componentwise. 
4. Show that 
(2.32) NV?u(X,Y) = PaV*ulX,Y), 


where Pz : F — E is orthogonal projection on each fiber. Parallel to (2.7), produce a 
formula 


(2.33) Veu = *V?u+G(u)(Vu, Vu), orthogonal decomposition. 


354 15. Nonlinear Parabolic Equations 


Relate G'(w) to the second fundamental form of N C R*; show that 
(2.34) G(u)(Vu, Vu)(X, ¥) = 1% (Du(x)X, Du(2)Y). 


5. Suppose N is a hypersurface of R*, given by N = {x € R* : v(x) = C}, with 
Vy #0 on N. Show that [(u)(Vu, Vu) in (2.7) is given in this case by 


(2.35) T(u)(Vu, Vu) = — 2 O;u(x) - D?p(u(z)) - O;u(x) 


Compare with the geodesic equation (11.14) in Chap. 1. 

6. Ifdim M = 2, show that the energy E(w) given by (2.1) of asmooth map u : M — N 
is invariant under a conformal change in the metric of M, that is, under replacing the 
metric tensor g on M by g’ = e*“g, for some real-valued f € C°(M). 

7. Show that any isometry w : M — N of M onto N is a harmonic map. 

8. Show that if dim M = dim N = 2andw: M — N is aconformal diffeomorphism, 
then it is harmonic. (Hint: Recall Exercise 6.) =e 

9. Ifu: M — N is an isometry of M onto a submanifold M C N that is a minimal 
submanifold, show that wu is harmonic. 

10. If dim M = 2 and f : M — N, show that 


E(f) > Area(f(M)), 


with equality if and only if f is conformal. 


3. Semilinear equations on regions with boundary 
The initial-value problem 


(3.1) au = Au+ F(t,z,u,Vu), u(0) =f, 


for u = u(t, x), was studied in § 1 for x € M, a compact manifold without bound- 
ary. Here we extend many of these results to the case where x € M, a compact 


manifold with boundary. As in § 1, we assume F' is smooth in its arguments. We 
will deal specifically with the Dirichlet problem: 


(3.2) u=OonRt x 0M. 


There is an analogous development for other boundary conditions, such as 
Neumann or Robin boundary conditions. 

Recall that Propositions 1.1 and 1.1A were phrased on a very general level, so 
a number of short-time existence results in this case follow simply by verifying 
the hypotheses (1.3)—(1.6), for appropriate Banach spaces X and Y on M. For 
example, somewhat parallel to (1.10), consider X = Cj(M), Y = C(M), 
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where, for 7 > 0, we set 
(3.3) C}(M) = {f € C7(M) : f =0 on OM}. 


In Proposition 7.4 of Chap. 13, it is shown that e’4 is a strongly continuous semi- 
group on CO} (/). Also, (7.52) of Chap. 13 gives 


(3.4) lle flag < Ct |Iflln~, ford<t<1, 
so we have the following: 


Proposition 3.1. If f € Ci (M), then (3.1)(3.2) has a unique solution 


(3.5) u € C([0,T),C'(M)), 
for some T > 0, estimable from below in terms of || f ||c1. 


If we specialize to F' independent of Vu, hence look at 


(3.6) a =Lu+F(t,z,u), u(0)=f, 


we can take X = C;,(M), Y = C(M), and, by arguments similar to those used 
above, we obtain the following result: 


Proposition 3.2. If f € C,(M), then (3.6), (3.2) has a unique solution 


(3.7) u€ C((0,T),C(M)). 
for some T > 0, estimable from below in terms of || f \|z~. 


We can obtain further regularity results on solutions to (3.1) and (3.6), with 
boundary condition (3.2), making use of regularity results for 


(3.8) au = Aut g(t,z), u(t,v) =0 for ce 0M, 


established in Exercises 4-10 of Chap. 6, § 1. To recall the result, let us set, for 
keZt, 


(3.9) H*(IxM) = {u € L?(IxM) : du € L?(I, H?*-75(M)),0 <j < k}. 
The result is that if (3.8) holds on J x M, with I = [0, To], then 


(3.10) ge Hu (Ix M) > ueH* (1 x M), 
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for I’ = [e, To], ¢ > 0. Taking g = F(t,x,u, Vu) for u in Proposition 3.1 and 
g = F(t, x,u) for u in Proposition 3.2, we have in both cases g € H°(I x M), 
whenever Ty) < T’, and hence 


(3.11) uE€H'(I' x M), 


in both cases. One also has higher order regularity. For simplicity, we restrict 
attention to the setting of Proposition 3.2. 


Proposition 3.3. Assume F is smooth in its arguments. The solution (3.7) of (3.6), 
(3.2) has the property 


(3.12) ue C~((0,T) x M). 
Proof. In this case, we start with the implication 
(3.13) ueéC(Ix M)NH'(I' x M) = Fi(t,2,u) € H'(I' x M), 


as follows from the chain rule and Moser estimates, as in Proposition 3.9 in 
Chap. 13. Applying (3.10) then gives u € H?(I’ x M). More generally, 


(3.14) ue C(Ix M)NH*(I' x M) = Fi(t,2,u) € H*(I' x M). 
Repeated applications of this plus (3.10) then yield u € H**1(I’ x M) for all k, 
which implies (3.12). 


Exercises 


1. Work out results parallel to those presented in this section, when the Dirichlet boundary 
condition (3.2) is replaced by Neumann or Robin boundary conditions. 
2. Consider the 3-D Burger equation 


(3.15) u+Vuu= Au, u(0,2)= f(x), u(t,c) =0, fore € OQ, 


where u : R* x Q — R?, and Q is a bounded domain in R? with smooth boundary. 
Show that the set-up to prove local existence works, with 


X =H3(Q), Y =L**(Q). 
(Hint: Show that Hd (Q) - L?(Q) c L°/?(Q) and D(A’*) Cc L3(Q), hence 


lle“ Fila) < Ct If llzs/2@qy, ford <t <1.) 


4. Reaction-diffusion equations 


Here we study @ x & systems of the form 


4. Reaction-diffusion equations 357 


Ou 
(4.1) a Iu+X(u), u(0)=f, 


where u = u(t, x) takes values in R’, X is a real vector field on R‘, and L is a 
second-order differential operator, which we assume to be a negative-semidefinite, 
self-adjoint operator on L?(M). We take M to be a complete Riemannian man- 
ifold, of dimension n, often either R” or compact. The numbers n and & are 
unrelated. We do not assume L is elliptic, though that possibility is not precluded. 

Such a system arises when “substances” S,, 1 < v < £, whose concentrations 
are measured by u,,, are simultaneously diffusing and interacting via a mechanism 
that changes these quantities. Recall from the introduction to Chap. 11 the relation 
between the quantity wu, of S, and its flux J,, in case S,, is being neither created 
nor destroyed. This generalizes to the identity 


5 f wilt) ava) = = ee +m u(t, z)) dV(2) 


oO 


if X,,(u) is a measure of the rate at which S,, is created, due to interactions with 
the “environment,” namely, with the other S,,. Consequently, by the divergence 
theorem, 


Ouy 
Ot 


If we assume that each S,, obeys a diffusion law independent of the other sub- 
stances, of the form considered in Chap. 11, that is, 


= —div J, + X,(u). 


J, = —d, grad up, 


then we obtain the system (4.1), with L = DA, where D is a diagonal ¢ x ¢ matrix 
with diagonal entries d,, > 0; we allow the possibility d, = 0, which means S,, 
is not diffusing. 

An example of the sort of system that arises this way is the Fitzhugh-Nagumo 
system: 


Ov OP v 

= + f(v) —w, 
Oe = ev — ww) 
DE E(u — yw), 


with 


In this case, 


(4.3) b= (Fs i 
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Here D is a positive constant. This arose as a model for activity along the axon of 
a nerve, with v and w related to the voltage and the ion concentration, respectively. 
We will mention other examples later in this section. While we will mention what 
various examples model, we will not go into the mechanisms behind the models. 
Excellent discussions of all these models and more can be found in [Mur]. 

One property L in (4.3) has is the following generalization of the maximum 
principle: 


Invariance property. There is a compact, convex neighborhood K of the origin 
in R¢ such that if f € L?(M), then, for allt > 0, 


(4.4) f(x) €K for all « — e'” f(x) € K for all x. 
Thus, if f,g € L?(M) have compact support, 
(4.5) len flue < all fllo~, 


with « independent of t > 0. If we defined a norm on R* so that K M (—K) was 
the unit ball, would have « = 1. Note that, for such f and g, we have 


(4.6) (ef 9) = M(fe9)| < aMlfllcallgiic~, 


so lle” f |p. < k|| f||,1. Thus e’” has a unique extension to a linear map 
(4.7) e : LP(M) — L?(M), le“ || < yp, 


in case p = 1, hence, by interpolation, for 1 < p < 2, and, by duality, for 
2 < p < o, uniqueness for p = oo holding in the class of operators whose 
adjoints preserve L'(M). 

As mentioned above, in many examples of reaction-diffusion equations, 
L = DL, where D is a diagonal € x £ matrix, with constant entries d; > 0, 
and Lo is a scalar operator, generating a diffusion semigroup on L?(/); in fact, 
often M = R and Lo = 07/0x?. For such L, any rectangular region of the form 
K = {y € R°: a; < y; < bj} has the invariance property (4.4). If some of the 
diagonal entries d; coincide, there will be a somewhat larger set of such invariant 
regions. 

We apply the technique of § 1 to obtain solutions to (4.1), rewritten as the 
integral equation 


(4.8) u(t) = a! f + i: el 94 X (u(s)) ds. 
0 


Proposition 4.1. Let V be a Banach space of functions on M with values in R° 
such that 


(4.9) e'” : V —+ V is a strongly continuous semigroup, for t > 0, 
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and 
(4.10) X:V — V is Lipschitz, uniformly on bounded sets, 
where (Xf)(a) = X(f(ax)). Then (4.8) has a unique solution 
Wwe C((0,7],V), 
where T > 0 is estimable from below in terms of |\|f |v. 

The proof is simply a specialization of that used for Proposition 1.1. Note that 
(4.10) holds for a variety of spaces, such as V = L?(M,R*‘), V = C(M,R‘), 
when X is a vector field on R* satisfying 
(4.11) IX@I <i, IIVX@)I| SC, Vy eR’, 
provided M is compact. If M has infinite volume, you also need X(0) = 0 for 
V = L?(M,R°*) to work. Whenever X has this property, and L satisfies the 
invariance property (4.4), it follows that Proposition 4.1 applies, for initial data 
f € L?(M,R‘), 1 < p < oo. If, in addition, e*” : C(M) — C(M), we also 


have short-time solutions to (4.8) for f € C(M,R°). For example, if M = R” 
and L has constant coefficients, then (4.7) implies 


(4.12) ef! . H*?(R") — H*?(R"), k>O0. 
Also 
(4.13) a” rC(R") 4 CR), 


for t > 0, since C,(R”), the space of continuous functions vanishing at infinity, 
is the closure of H*:?(IR”) in L°(R"), for k > n/2. 
Another useful example when J = R” is the space 


(4.14) BC(R”) ={f € C(R”) : f extends continuously to R}, 


where R” is the compactification of R” via the sphere at infinity (approached 
radially). For k € Z*, we say f € BC*(IR”) provided D* f € BC(IR") whenever 
la] <k. 

If M@ = R” and (4.12) holds, then Proposition 4.1 applies with V = 
H*-?(IR”,IR°) whenever the vector field X and all its derivatives of order <k 
are bounded on R* (and X(0) = 0). It also applies to BC(R”,R°). Now if L 
is not elliptic, we have no extension of the regularity result in Proposition 1.2. 
By a different technique we can show that under certain circumstances, if f 
belongs to a space like H*?(R”,R*), then a solution u(t) persists as a solution 
in C((0, 7], H"?(R”)) as long as it persists as a solution in C((0, 7], C,(R”)). 
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To get this, we reexamine the iterative formula used to solve (4.8), namely 


t 
(4.15) uji() = es + f e-92X (u;(s)) ds. 
0 


As long as (4.9) holds for the Banach space V, we have 


t 
luz lly < lle flv + c| eX) LX (uj(s))|lv ds 


< A(t) +Cte** sup ||X(uj(s))Ilv 
O<s<t 


(4.16) 
and 
G17) [jujsr()—wsOlly S Cte sup |X (uj(s)) — X(uja(9))lv- 


Now, as shown in Chap. 13, § 3, for such spaces as V = H*:?(IR”), there are 
Moser estimates, of the form 


(4.18) |uelly < Cllullr~|lully + Cllullv lel 

and 

(4.19) [E(w)llv <C(lullz~)(2 + llullv), C@)= sup |F (2). 
|2|<A,|u|<k 


In particular, ||_X (u)||v satisfies an estimate of the form (4.19). Also, we can write 
1 
X(u) — X(v) =Y(u,v)(u-—v), Y(u,v) = | DX (out (1—o)v) do 
) 


and obtain the estimate 
|X(u) — X(v)|lv 
(4.20) < C(lullze + llullze) lu — ullv 
+ C(|lullz~ + llullze) (elly + lolly) lu — ollze- 


From (4.16) we deduce 
(4.21) |luj+i(é)llv < A(t) + te* oe C([luj(s)llz) (1 + llug(s)Ilv). 
If {u;(t) : 7 € Z*} is bounded in L~(M) for 0 < t < T, this takes the form 


(4.22) lujri(H|lv < Bi+ Bt sup |lu;(s)|lv, 
O<s<t 
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for 0 < t < T. Also, in such a case, (4.17) and (4.20) yield 


I|uj+1() — u;s(t)|lv < Bt a I|u3(s) — uz-1(s)Ilv 
(4.23) —— 
+ Bi sa (|u5(s) [lv + l]uj—1(s)|lv) lus (s) — uj—1(s)|[ Le. 


Now, in (4.22) and (4.23), B may depend on the choice of the space V, but it does 
not depend on the V-norm of any u;(s), only on the L°°-norm. 

Let us assume that uo(t) = e!” f satisfies ||uo(t)\|y < B,, for0 < t < 
T, B, > B. This is the B, used in (4.22). Assume Tp < 1/4B, To < 1/16BB), 
and Ty < T. Then ||u;(t)||v < 2B, for 0 < t < To, for all j € Z*, so {uj : 7 € 
Z*} is bounded in C([0, Zp], V). In such a case, (4.23) yields, for 0 < t < To, 


1 
lluj+i(é) — us()Ilv SZ sup lus(s) — uj-1(s)llv 
O<s<t 
(4.24) 


1 
+7 sup_|luj(s) — uj-1(s)|Iz~, 
O<s<t 


so {uj : 7 € Z*} is in fact Cauchy in C([0, 7], V), having therefore a limit 
u € C([0, To], V) satisfying (4.8). The size of the interval [0,7] on which this 
argument works depends on the choice of V and the size of ||w(0)||,-0, but not 
on the size of ||«(0) ||. We can iterate this argument on intervals of length To as 
long as ||u(t)||z-0 is bounded, thus establishing the following. 


Proposition 4.2. Suppose V is a Banach space of functions such that (4.9)-(4.10) 
and the Moser estimates (4.18)-(4.19) hold. Let f € VM L*©(M), and suppose 
(4.8) has a solution u € L®([0,T) x M). Then, in fact, u € C([0,T),V). If 
V = H*?(M), with k > 2, we thus have 


(4.25) u€ C((0,T), H*?(M)) NC1([0,T), H*-?(M)), 
solving (4.1). 


Global existence results can be established for (4.1) when f takes values in a 
bounded subset of R‘ shown to be invariant under the nonlinear solution operator 
to (4.1). An example of this is the following: 


Proposition 4.3. In (4.1), assume Lu = DAu, where D is a diagonal ¢ x € 
matrix with diagonal entries d; > 0 and A acts on u componentwise, as the 
Laplace operator on a Riemannian manifold M. Assume M is compact and f € 
H*?(M,R*), k > 2+n/p. Or assume M = R”, with its Euclidean metric and 
f € BC?(R",R°). Consider a rectangle R C R*, of the form 


(4.26) R= {ye R’: a; < y; < dj}. 
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Suppose that, for each y € OR, 


(4.27) X(y)-N <0, 


fe) 
where N is any outer normal to R. If f takes values in the interior R of R, then 


the solution to (4.1) exists and takes values in R for allt > 0. 


Proof. First suppose / is compact. If there is an exit from R, we can pick 
(to, Zo) such that 


(4.28) Uj (to, Xo) = a; Or b;, 


for some j = 1,...,@, and u(t,z) € R for all t < to, e € M. Pick bj, for 
example. Then 


(4.29) Opt; (to, Xo) > 0. 
Now y() = u,(to, 2) must have a maximum at x = 2, so 
(4.30) Opus (to, Xo) = d; Au; + X;(u) < X; (u). 


However, (4.27) implies Xj (u(to,%0)) <0, so (4.29) and (4.30) contradict each 
other. 

In case M = R", the existence of such (to,79) € R* x R" is problematic, 
though we can find such (to,29) € R*™ x R”, since u has a unique continuous 
extension to R+ x R” and R” is compact. We still have O,u(to, 20) > 0, and Au 
is continuous on Rt+ x R, but it is not obvious in this case that Au(to, 70) < 0, 
unless x lies in R”, not at infinity. Thus we argue as follows. 

Let BC (R”) denote {f € BC?(R") : D°f = Oatoo, for |a| = 2}. This 
Banach space is also one for which Propositions 4.1 and 4.2 work. Furthermore, 
the argument above regarding u(to, 279) does work if we replace f € BC?(R”, R°) 
by fp € BC *(R”, R*). Additionally, we can take a sequence of such f,, so that 
fv — f in BC(R”,R®), and obtain solutions u, such that u,(t,7) > u(t, x) 
uniformly on [0,7] x R” for any T < co. We can replace R by a slightly smaller 
rectangle R,, for which (4.27) holds, and arrange that each f, takes values in 1. 


Then u(t, x) always takes values in R, C R. This completes the proof in the case 
M =R”. 


As an example of Proposition 4.3, we consider the Fitzhugh-Nagumo system 
(4.2), in which the vector field _X on R? is 


(4.31) X(v,w) = (f(v) —w,e(v—yw)),  f(v) = v(1 — v)(v—a). 
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In Fig. 4.1 we illustrate an invariant rectangle R that arises from the choices 
(4.32) y=20, a=0.4, ¢=0.01. 


This invariant region contains three critical points of X, two sinks and a saddle. 
For this construction to work, we need the following: 


The top edge of R lies above the line w = v/7, 

while the bottom edge of 7e lies below this line; 

the left edge of FR lies to the left of the curve w = f(v), 
while the right edge of 7 lies to the right of this curve. 


The two curves mentioned here are the “isoclines,” defining where X2 = 0 and 
X 1 = 0, respectively. The condition just stated implies that X points down on 
the top edge of R, up on the bottom edge, to the right on the left edge, and to 
the left on the right edge. In Fig. 4.1 we also depict a smaller invariant rectangle 
Ro, which contains only one critical point of X, the sink at (0,0). Figure 4.2 is 
a similar illustration, with ~ changed from 20 to 10; in this case X has only one 
critical point. 

The vector field (4.31) does not actually satisfy the hypothesis (4.11), since the 
coefficients blow up at infinity. But one can alter X outside R to produce a vector 
field X to which Propositions 4.1 and 4.2 apply. As long as the initial function 
u(0) = f takes values inside R, one has a solution to (4.2). 

While Proposition 4.3 is an elementary consequence of the maximum princi- 
ple, this result can also be seen to follow quite transparently from a “nonlinear 
Trotter product formula,” namely a solution to (4.1) satisfies 


(4.33) u(t) = lim (e/ os) "( 7), 


vt—F 00 


\, 


FIGURE 4.1 Invariant Rectangles for Fitzhugh-Nagumo System 
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WZ 


FIGURE 4.2 Invariant Rectangles with Different Parameters 


where if F& is the flow on R‘ generated by X, then 


(4.34) F* f(z) = F& (f(z). 


We will prove this in the next section. See Proposition 5.4 for a precise 
statement. We mention that if f € BC'(R",R‘), then (4.33) converges in 
C((0, 7], BC°(R",R°)). We can use this result to prove the following, which 
is somewhat stronger than Proposition 4.3. 


Proposition 4.4. Assume X € C?(R°) and u(0) = f € BC'(R",R®), and let L 
be a second-order differential operator with constant coefficients, such that e‘” is 
a contraction on BC°(R",R‘), for t > 0. Assume there is a family {K, : 0 < 
s < co} of compact subsets of R‘ such that each K, has the invariance property 
(4.4). Furthermore, assume that 


(4.35) FL(K,) C Keay, 8,t eR. 


If u(0) = f takes values in Ko, then (4.1) has a solution for allt € Rt, and 
u(t, xv) © Ky. 


Proof. This is a simple consequence of the product formula (4.33). 


In cases where L is diagonal and { K,} is a certain shrinking family of rectan- 
gles, this result was proved in [RaSm], by different means. An example to which 
their result applies arises when [Xo is the rectangle Ro in Fig. 4.1. Then Kr, is 
a family of rectangles shrinking to the origin as s —> oo, and one gets decay 
of any solution to (4.2) whose initial function u(0) = f takes values in such a 
rectangle Ky. Of course, if there were no diffusion (i.e., D = 0 in (4.2)), one 
would get such decay whenever u(0) = f took values in the region of R? for 
which the origin is an attractor. One then has the question of whether a sufficient 
degree of diffusion could change the situation. We will return to this point shortly. 
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For the system (4.2), there are families of rectangles K’, such that Ko contains 
an arbitrarily large disk centered at the origin, which contract to Ky = R, satis- 
fying (4.35). Hence any solution to (4.2) with initial data in BC : (IR, R?) exists for 
all t > 0, and, for t large, u(t) takes values in R. However, in the cases illustrated 
in both Figs. 4.1 and 4.2, there is not a family of rectangles having the property 
(4.35) taking R to Ro, and in fact not all solutions to (4.2) with initial data in 
BC" (IR, R?) will decay to a constant function. 

One class of nondecaying solutions to reaction-diffusion equations of particu- 
lar interest is the class of “traveling wave solutions,” which, in case M = R and 
L has constant coefficients, are sought in the form 


(4.36) u(t, 2) = p(x — ct). 


Suppose L = D0?, where D is a diagonal matrix, with entries d; > 0. Then y(s) 
must satisfy the second-order ¢ x @ system of ODEs: 


(4.37) De" + cy’ + X(v) = 0. 
Using ~ = vy’, we convert this to a first-order (22) x (22) system 
(4.38) y=, Dw’ = -cy— X(y). 


If some d; = 0, it is best not to use q;. 
Let us first take a closer look at the scalar case, which we write as 


Ov D O?u 


(4.39) AL amt g(v). 


Then a traveling wave v(t, z) = y(a — ct) arises when ¢(s) satisfies the single 
ODE 


(4.40) De" + cp’ + g(y) = 0. 
With 7) = y’, we have the 2 x 2 system 
(4.41) g=%, Wp =-cp—g(¥), 
taking D = 1 without essential loss of generality. This is amenable to a simple 
phase-plane analysis. 
The vector field Y = Y, whose orbits are specified by (4.41) has critical points 


at w = 0, g(y) = 0. For a general smooth g in (4.39)-(4.41), if = ais a zero of 
g, and if g'(a) = o, then the linearized ODE about the critical point (a, 0) of Y is 


(4.42) ED a=(° a) 
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FIGURE 4.3 Function with Three Zeros 


Note that 
(4.43) TrA=-c, detA=o. 
This establishes the following: 


Lemma 4.5. If g(a) = 0, the critical point (a, 0) of the vector field Y defined by 
(4.41) is 
a saddle if g'(a) <0, 
a sink if g'(a) > 0 andc > 0, 
a source if g'(a) > Oandc < 0. 


Of course, when c = 0, (4.41) is in Hamiltonian form, with energy function 


(4.44) E(y,b) = 57 +Gly), G(?) = f a) dy. 


In that case, the integral curves of Y are the level curves of E(y, w), and a non- 
degenerate critical point for Y is either a saddle or a center. For c = 0, v(t, x) = 
(p(x) is a stationary solution to the PDE (4.39). If c 4 0, we can switch signs of 
s if necessary and assume c > 0. Then (4.40) models motion on a line, in a force 
field, with damping, due to friction proportional to the velocity. On any orbit of 
(4.41) we have 


dE 


(4.45) mo 


= —cp(s)? <0. 
This implies that Y cannot have a nontrivial periodic orbit if c > 0. 

Let us consider a case where g has three distinct zeros, a1, a2, a3, as depicted 
in Fig. 4.3. In this case, Y has saddles at (a1, 0) and (a3, 0), and a sink at (a2, 0). 
Now the three points (a,;,0) are also critical points of the function E(y,~), 
defined by (4.44), and, depending on whether the critical values at (a1,0) and 
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ay 
(a) (b) 


FIGURE 4.4 Vector Fields with Centers 


(a) (b) 


FIGURE 4.5 Vector Fields with Spiral Sinks 


(a3,0) are equal or not, the level curves of E(y, ~) (orbits of the c = 0 case of 
(4.41)) are as depicted in Fig. 4.4. When we take small c > 0, the orbits of Y in 
the cases (a) and (b), respectively, are perturbed to those depicted in Fig. 4.5. In 
case (a), both saddles are connected to the sink, while in case (b) just one saddle 
is connected to the sink. 

In case (b), if we let c increase, eventually the phase plane has the same behav- 
ior as (a). There will consequently be a particular value c = cg where an orbit 
connects the saddle (a,,,0) to the saddle (a,,0), where a,, is the zero of g for 
which G(y) = J g(y) dy has the largest value. An orbit connecting two differ- 
ent saddles is called a “heteroclinic orbit.” (Note that in case (b), at c = O there 
is an orbit connecting the other saddle (a,,,0) to itself; such an orbit is called a 
“homoclinic orbit.”) In an obvious sense, a, is the endpoint (either a; or a3) of 
the “smaller” of the two “humps” of w = g(y) in Fig. 4.3, the size being measured 
by the area enclosed by the curve and the horizontal axis. 

Such an orbit of Y connecting (a,,,0) to (a,,0) then gives rise to a traveling 
wave solution u(t,z) = v(x — cot), which, for each t > 0, tends to a,, as 
x —+ —oo and to a, as x — +00. If a, is the remaining zero of g(v), then 
for each c > 0, there is a traveling wave u(t, x) = ¢(a — ct), which tends to a, 
as x — —oo and to a, as x —> +00; and if c > co, there is a traveling wave 
u(t, x) = p(x — ct), which tends to a,, as x — —oo and to a, as > +00. 

Such traveling waves yield a transport of quantities much faster than straight 
diffusion processes, described by 0u/Ot = DAu. Yet this speed is due not to 
any convective term in (4.1), but rather to the coupling with the nonlinear term 
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X(u). Such behavior, according to Murray [Mur], was “a major factor in starting 
the whole mathematical field of reaction-diffusion theory.” 

Note that in the limiting case of (4.2) where ¢ = 0, w = wg is independent of 
t, and we get a scalar equation of the form (4.39), with g(v) = f(v) — wo, if wo 
is also independent of x. Another widely studied example of (4.39) is 


(4.46) g(v) = v(1 — v). 


In this case the vector field Y has two critical points: a saddle and a sink. This case 
of (4.39) is called the Kolmogorov—Petrovskii—Piskunov equation. It is also called 
the Fisher equation, when studied as a model for the spread of an advanlabeleous 
gene in a population; see [Mur]. 

If (4.1) is a2 x 2 system with L = DA, then one gets a vector field on R* from 
(4.38), provided D is positive-definite. If d; > 0 but dj = 0, then, as noted above, 
one omits ¢/2 and obtains a 3 x 3 system. For example, for the Fitzhugh-Nagumo 
system (4.2), one obtains traveling waves u = (v,w) = (1,2), provided 
1, 2, and 7 satisfy the system 


£4 = Wi, 
; 1 
(4.47) v= — (crs + f(~i) - 2), 
,_ € 
Po = -<(¢1 om 2): 
This has the form 
(4.48) C =Z,(0), 


for € = (v1, V1, $2), where Z, is a vector field on R°. 

Various techniques have been brought to bear to analyze orbits of such a vec- 
tor field. An important role has been played by C. Conley’s theory of “isolating 
blocks”; see [Car, Con, Smo]. It has been shown that, for small positive ¢, there 
exist c such that Z, has a periodic orbit, yielding periodic traveling waves for 
(4.2). Also, for certain c = c(e), Z, has been shown to have a homoclinic orbit, 
with (0, 0,0) as limit point. Such homoclinic orbits have been found numerically, 
with the aid of computer graphics, in [Rab]. The traveling wave arising from such 
a homoclinic orbit is called a pulse. (It follows from (4.45) that such a pulse cannot 
arise for scalar equations of the form (4.39) if D > 0.) There is a phenomeno- 
logical interpretation when (4.2) is taken as a model of activity along the axon of 
a nerve. As seen above, a sufficiently small initial condition (vo (x), wo(x)) pro- 
duces a solution decaying to (0,0) at t + oo. This traveling wave then arises from 
a sufficiently large initial condition. One says a “threshold behavior” is involved. 

For a variant of the Fitzhugh-Nagumo system proposed by H. McKean, [Wan] 
has established the existence of “multiple impulse” traveling wave solutions. 
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An interesting question is the following: For given initial data, when can you 
say that the solution u(t) behaves for large ¢ like a traveling wave? For the 
Kolmogorov—Petrovskii—Piskunov equation, work has been done on this question 
in [KPP] and [McK]. For other work, see [AW1, AW2, Bram, Fi]. 

If M = R",n > 1, and L = DA, one can seek a solution to (4.1) in the form 
of a traveling plane wave, u(t, x) = y(a-w — ct), where w € R” is a unit vector. 
Again y(s) satisfies the ODE (4.37). In addition to plane waves, other interest- 
ing sorts arise in the multidimensional case, including “spiral waves” and “scroll 
waves.” We won’t go into these here; see [Grin] for an introductory account. 

Let us return to the evolution of small initial data f. Recall the argument that, 
for sup |f(x)| sufficiently small, a solution to the Fitzhugh-Nagumo system (4.2) 
decays uniformly to 0. For that argument, we used more than the fact that (0,0) 
is a sink for the vector field X in that case; we also used a family of contract- 
ing rectangles. It turns out that, for a general reaction-diffusion equation (4.1) for 
which X has a sink at p € R*, specifying that f(x) be uniformly close to p does 
not necessarily lead to a solution u(t) tending to p as t oo. One can have the 
phenomenon of “diffusion-driven instability,” or a “Turing instability,’ which we 
now describe. For simplicity, let us assume L = DA with D = diag(d;,..., de), 
where A is the Laplace operator (acting componentwise) on an ¢-tuple of func- 
tions on a compact manifold M. 

We first give examples of this instability when X is a linear vector field, 
X(u) = Mu, so that Lu + X(u) = (£+ M)u is a linear operator. If { f;} 
is an orthonormal basis of L?(M) consisting of eigenfunctions of A, satisfying 
Af; = —a% f;, then Lu + Mu satisfies 


(4.49) (L+ M)(yf;) = (-a5Dy+ My) fj, y ER’. 


Now, under the hypothesis that 0 € R° is a sink for X, we have that both of the 
£ x £ matrices —a;D and M have all their eigenvalues in the left half-plane. All 
there remains to the construction is the realization that if two matrices have this 
property but do not commute, then their sum need not have this property. Consider 
the following 2 x 2 case: 


1 b-1 a 
(4.50) p=( a a= (7, i) 


Assume 0 < b < 1+ a7, a > 0. Thus Tr M = b — (1 +a?) < 0, while det 
M =a? > 0,so M has spectrum in the left half-plane. As before, assume d > 0. 
With \ = as, consider 


b-1-A a? 
4.51 N=M-—-AD= F 
oy) 2 ( —b Pat 


Of course, Tr NV < 0 if \ > 0; meanwhile, 
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(4.52) det N = dd? + [a? + d(1 — b)|A +a? = p(d). 
The matrix N will fail to have spectrum in the left half-plane, for some A > 0, 


if p(A) is not always > 0 for \ > 0, hence if p(A) has a positive root. From the 
quadratic formula, the roots of p() are 


2 = 
(4.53) rL= == au b) a os [a? + d(1 b)] 


* _ Aad, 


Thus p() will have positive roots if and only if a? + d(1 — b) < 0 and [a? + 


d(1 — 6)] * Ss dad. Recalling the conditions on a and b to make Tr M < 0, we 
have the following requirements on the positive numbers ), a, d: 


(4.54) b—1<a?<(b-—1)d, 2(b+1)a’d < (b—1)?d? + 0%, 


which also requires b > 1,d > 1. For example, we could choose b = 2, a? = 2, 
and d = 20, yielding 


1 1 2 
(4.55) p=( a: u=(4, as 


Under these circumstances, M/ — \D will have a positive eigenvalue for 
(4.56) A € (r_,r+4) C (0,00), 


where r+ are given by (4.53). 

Consequently, if the Laplace operator A on M has an eigenvalue —a? whose 
negative is in the interval (4.56), arbitrarily small initial data of the form yf; will 
be magnified exponentially by the solution operator to u; = (ZL + M)u, provided 
y has a nonzero component in the positive eigenspace of MZ — a’ D, despite the 
fact that the origin is a stable equilibrium for the evolution if the diffusion term is 
omitted. 

An example of a nonlinear reaction-diffusion equation that exhibits this phe- 
nomenon is the “Brusselator,” 


°° = Av+v°w — (b+ 1)u +4, 
(4.57) an 
a dAw — vw + bv, 


governing a certain system of chemical reactions. We assume a, b, d > 0. The vec- 
tor field X (which incidentally has flow leaving invariant the quadrant v,w > 0) 
has a critical point at (a, b/a), and its linearization at this critical point is given 
by the matrix M in (4.50). Thus if up = (vo, wo) is a small perturbation of the 
constant state (a, b/a), if the estimates (4.54) hold, and if A has an eigenvalue 


4. Reaction-diffusion equations 371 


whose negative is in the range (4.56), then a vector multiple of the eigenfunction 
f; will be amplified by the evolution (4.57). Of course, once this acquires appre- 
ciable size, nonlinear effects take over. In some cases, a spatial pattern emerges, 
reflecting the behavior of the eigenfunction f; (a7). One then has the phenomenon 
of “pattern formation.” 

In light of the instability just mentioned, we see some limitations on using 
invariant rectangular regions to obtain estimates. Consider the following more 
general type of Fitzhugh-Nagumo system: 


O7u Ow 


(4.58) a = Page + f(v) +a(v,w), oe b(v, w), 


where f,a, and b are assumed to be smooth and satisfy 

(4.59) fa(v,w)| < A(lo] + |w| +1),  [b(v,w)| < B(lvo] + |w| +1), 
and 

(4.60) f(v) < Civ, forv > C2, f(v) > Civ, forv < —C, 


where A, B, C1, and C2 are positive constants. There need be no large invariant 
rectangles in such a case, as the example f(v) = (2A + 2B + 1)v shows. Never- 
theless, one will have global solutions to (4.58) with data in BC! (IR, R?). In fact, 
this is a special case of the following result. 

To state the result, we use the following family of rectangular solids. Let Q(s) 
be the cube in R° centered at 0, with volume (2s)*, and let §.;(s) be the face of 
this cube whose outward normal is +e;, where {e; :1< 7 < ¢} is the standard 
basis of R*. 


Proposition 4.6. Let L = DA with D = diag(d,,..., dg) in (4.1). Assume the 
components X, of X satisfy, for some Co € (0, 00), 


Xj (y) <+Cos for y € F+;(s), 


< 
4.61 7 
er £6) =—Chajorye 5 46s), 


for all s > Cy. Then (4.1) has a global solution, for any f € BC'(R",R°). 


Proof. We will obtain this as a consequence of (4.33). Use the norm ||y|| = 
max, |y;| on R* to construct the norm on function spaces. The hypothesis implies 
(4.62) Fxull <e" llyl, #20 


> 


whenever ||y|| > C2. Consequently, 


ay [(eomneony a sell +0), 
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so by (4.33) we have the bound on the solution to (4.1): 
(4.64) lu(@)llz~= < e*([Ifllax + C2). 


Note that this is an application of Proposition 4.4, in a case where {K,} is an 
increasing family of rectangular solids. Other proofs of global existence for (4.58) 
under the hypotheses (4.59)-(4.60) are given in [Rau] and [Rot]. 

There are some widely studied reaction-diffusion equations to which Propo- 
sition 4.6 does not apply, but for which global existence can nevertheless be 
established. For example, the following models the progress of an epidemic, 
where v is the density of individuals susceptible to a disease and w is the den- 
sity of infective individuals: 


(4.65) — =—TvU, ae = DAw+rvw — aw. 

Assume r, a, D > 0. In this model, only the sick individuals wander about. Let’s 
suppose A is the Laplace operator on a compact two-dimensional manifold (e.g., 
the surface of a planet). One can see that the domain u,v > 0 is invariant; initial 
data for (4.65) should take values in this domain. We might consider squares of 
side s, whose bottom and left sides lie on the axes, but the analogue of (4.61) fails 
for X2 = rvw — aw, though of course X, < 0 is fine. To get a good estimate on 
a short-time solution to (4.65), taking values in the first quadrant in R?, note that 


(4.66) J (v + w) = DAw — aw. 


Integrating gives 


(4.67) 5 fw+ujav=-afwav co. 


M M 


By positivity, [(v + w) dV = |lv(t)||z1 + ||w(t)||z:, which is monotonically 
decreasing; hence both ||v(t)||,1 and ||w(t)||,1 are uniformly bounded. Of course, 
we have already noted that ||v(t)||L- < ||vol|ze. Thus, inserting these bounds 
into the second equation in (4.65), we have 


(4.68) oe = DAw + g(t, 2), 
Ot 
where 
(4.69) lg@llzrazy S rile@)|l2~|/wO|lz2 + @llw@|ln. < C. 


Now use of 
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t 
(4.70) w(t) = e Pwo +f e—8)DAq(s) ds 
0 


plus the estimate 
(4.71) lle Ilcctt,ze) < Cs OM) 


when dim M = 2, yields an L?-estimate on w(t), and another application of 
(4.70) then yields an L°°-estimate on w(t), hence global existence. 

Actually, for a complete argument, we should replace vw in the two parts of 
(4.65) by 3, (uw), where 3,,(s) = s for |s| < v,and 6,(s) =v+1fors >v+1, 
get global solvability for such PDE, with L°°-estimates, and take v — oo. We 
leave the details to the reader. 

The exercises below contain some other examples of global existence results. 
In [Rot] there are treatments of global existence for a number of interesting 
reaction-diffusion equations, via methods that vary from case to case. 


Exercises 

1. Establish the following analogue of Proposition 4.6: 
Proposition 4.6A. Let L = DA with D = diag(d1,...,de) in (4.1). Assume that 
the set tt = {y € R* : each yj > O} is invariant under the flow generated by X. 


If §+5(s) is as in Proposition 4.6, set a (s) = €* MN F4;(s), and assume that each 
component X; of X satisfies 


X;(y) < Cos, fory € 8; (s), 


for all s > Cy. Then (4.1) has a global solution, for any f € BC*(R",R*) taking 
values in the set €*. 


2. The following is called the Belousov—Zhabotinski system. It models certain chemical 
reactions, exhibiting remarkable properties: 


on = Av+v(l-—v—-—rw)+Lru, 
(4.72) ie 
w= Aw buw — Mw. 
Ot 


Assume r, L,b, M > 0. Show that the vector field X has flow that leaves invariant the 
quadrant {v,w > 0}. Show that Proposition 4.6A applies to yield a global solvability 
result. 
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3. The following system models a predator-prey interaction: 


2 DAv+v(1—v—w), 
(4.73) bi 
OE = Aw + aw(v — b), 


where v is the density of prey, w the density of predators. Assume D > 0,a > 0, and 
0 < b < 1. Show that the vector field X has a flow that leaves invariant the quadrant 
{v,w > 0} = €*. Show that Proposition 4.6A does not apply to this system. 
Demonstrate global existence of solutions to this system, for initial data taking values 
in the set €*. (Hint: Start with the identity 


3) 


(4.74) - 


(v+ =) =Ddo | ee ee 
a a 


and integrate, to obtain L+-bounds. Also use 


(4.75) a — DAv<v 
to obtain an L°°-bound on v. (If D > 0, recall Lemma 2.2.) Then pursue stronger 
bounds on wi.) 
4. If the model (4.65) of an epidemic is extended to cases where susceptible and infective 
populations both diffuse, we have 
(4.76) oe = D, Av —rvu, oe = DAw+rvw — au, 
where D,,D,r,a > 0. Establish global solvability for this system, for initial data 
taking values in €*. 
5. Study global solvability for the Brusselator system (4.57), given initial data with values 
in ¢*. (Hint: After getting an L’-bound on v + w, use 
Dt aah eh 
ot 
and an appropriately modified version of the argument suggested for Exercise 3, to get 
a stronger bound on w. Once you have this, use 


(4.77) Zwtw) A(v+w) =(d—1)Aw-—v+a 


to obtain a stronger bound on v + w.) 
6. Consider the following system, modeling a chemical reaction A+ B= C: 


at — Di Aka =c—ab, 
(4.78) b, — D2 Ab = c— ab, 
c, — D3Ac = ab—c. 

Note that X leaves invariant the octant €+ = {a,b,c > 0}. Assume D; > 0. Establish 


the global solvability of solutions with initial data in €*. Assume A is the Laplace 
operator on M, compact, with dim M < 3. (Hint: First get L’-bounds on a + c and 
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b+. c. Then use 
at — D,Aa < Cc, bt _ Dz2Ab < c, 


to get L?-bounds on a and b, for some p > 2 (if dim M < 3). Then use 
cz, — D3 Ac < ab 


to get L?-bounds on c. Continue. Alternatively, apply an argument parallel to (4.77) to 
a+, and relax the requirement on dim /.) 
For a treatment that works for dim M < 5 (and 0M # Q), see [Rot]. 

7. Extend results of this section to the case where L = DA, where D is a diagonal matrix, 
D = diag(d,,...,d¢), and A is the Laplace operator on a compact manifold MW with 
boundary. Consider each of the following boundary conditions: 


(a) Dirichlet, uy | a4 yy, = 9 
(b) Neumann, Ovtj|p+ am = % 
(c) Robin, OvUj — Qj (x)ujla+ xan — 9 


Apply such a boundary condition only if d; > 0. 
Also, consider nonhomogeneous boundary conditions. 


5. A nonlinear Trotter product formula 


In this section we discuss an approach to approximating the solutions to nonlinear 
parabolic equations of the form 


(5.1) —=Iu+X(u), u(0)=f, 


and some generalizations, to be mentioned below, by a process involving succes- 
sively solving the two simpler equations 


Ou Ou 
(5.2) salu ZH 


X(u), 

over small time intervals, and composing the resulting solution operators. If F* 
denotes the nonlinear evolution operator solving the equation Ou/Ot = X(u), we 
seek to show that the solution to (5.1) satisfies 


(5.3) u(t) = lim (e"/ m)L gt/ ”) (f). 

This is a nonlinear analogue of the Trotter product formula, discussed in 
Appendix A of Chap. 11. It is a popular tool in the numerical study of nonlinear 
evolution equations, where it is also called the “splitting method.” 
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We will tackle this via a variant of the analysis used in (A.17)-(A.30) in our 
treatment of the Trotter product formula in Chap. 11. The author came to under- 
stand this approach through conversations with J. T. Beale on the work [BG]. 
Other approaches are discussed in [CHMM]. 

We begin by setting 


k 
(5.4) ae P in) 
and then set 
sL vs k 1 
(5.5) u(t) =e’"F*vux, fort =—+s5, 0<8s<-. 
n n 


Then (under suitable hypotheses on L, etc.) u(t) > ug4i1 ast 7 (k + 1)/n, and, 
fork/n<t<(k+1)/n, 


(5.6) a = Lu + eX (F%vp) = Lv + X(v) + RE), 


where, again fort = k/n+s5,0<s<1/n, 


R(t) = e&" X(F®vx) — X(v) 


me = (€°" — 1) X(F°vp) + [X(Fevg) — X(eF*vg)]. 


To compare u(t) with the solution u(t) to (5.1), set w = v — u. Subtracting 
(5.1) from (5.6) gives 


(5.8) — =lw4+X(v)— X(u) + RUD), 
and if we write 
1 
(5.9) X(v) — X(u) = | DX(su+ (1—s)u) ds = Y(u,v)w, 
0 


we have for w the linear PDE 


(5.10) ae =Lw+ A(t,x)w+ R(t), w(0) =0, 
where 
(5.11) Alt) = ¥ (uz), 0,2), 


which is an ¢ x @ matrix function if (5.1) is an 2 x @ system. 
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We will treat this in a fashion similar to (A.24)-(A.28) in Chap. 11. To recall 
some points that arose there, we expect to show that R(t) is small only in a weaker 
norm than that used to measure the size of v — u. In our first set of results, we 
compensate by exploiting smoothing properties of e’”. Thus, for now we assume 
that L is a negative-semidefinite, second-order, elliptic differential operator. For 
the sake of definiteness, let us suppose L acts on (¢-tuples of) functions on R”, 
with domain 


(5:12) D(L).= B7(R”), 


We will seek to estimate v — u in some V -norm. 
Now, by Duhamel’s principle, 


(5.13) wt) = i et! /A(r)w(r) + R(r)] dr. 


Pick T > 0, 7 € (0,1), and Banach spaces of functions V, W for which we have 
an estimate of the form 


(5.14) le gly <Ct7 |gllw, O<t<T. 


The next step is to estimate the W-norm of R(t), given by (5.7). We have sepa- 
rated this into two parts: 


(5.15) Ry(t) = (e8” —I)X(F8vg), Ro(t) = X (Fug) — X(e8" Fe vp), 


where t = k/n+ 8, 0 < 5 < 1/n. Parallel to (A.26), we need an estimate of the 
form 


(5.16) len” — Ilevw) $C s°, 5>0, 


to estimate R,(t). Of course, this requires that W have a weaker topology than V. 
Granted this estimate, since s € [0,1/n] in (5.15), we obtain 


(5.17) IRi(llw <Cn~ |®i()|lv, 
where 

(5.18) ®1(t,2) = X(Wi(t,x)), 
with 


(5.19) Wi (t,2) = Feup(x) = FE (ve(z)), 
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where F% is the flow on R‘ generated by X, a vector field on R’. Meanwhile, 
(5.20) Ra (t) = 0,(t) — (2), 

where ®,(¢) is as in (5.18)—(5.19) and 

(5.21) Oo(t,x) = X(Vo(t,z)), Wo(t) =e W(t). 

Thus, with Y(u, v) given by (5.9), 

(5.22) Rot) = Z(t(I-—e")Wilt), Z(t) = Y(Vi(t), Wo(t)), 

so, again using (5.16), we have 

(5.23) Ra()llw SC n™ |Z()Icavy IMallv, 

where Z(t) denotes the operation of left multiplication by the matrix-valued func- 
tion Z(t, x). 


There remains the task of estimating the right sides of (5.17) and (5.23). This 
involves estimating the V-norms of 


Up = (coum pun)" p U,(t) = Fup, 
Wo(t) = e*' W(t), ©; (t) — X(;(t)), 


(5.24) 


and the £(W)-norm of Z(t) = Y (®1(t), ®2(t)). Thus, we want the estimates 
(5.25) lle“Iley Se, IF (Ally Se“llfllv, O<tS<T, 


rather than weaker estimates in which e“ is replaced by C,e®. On most of our 
favorite Banach spaces V, e‘” is frequently a contraction semigroup, while the 
second estimate in (5.25) may require more work to establish. Actually, we need 
this second estimate only for || f||y > some constant C,. We get a good estimate 
on all the quantities in (5.24) if the estimates (5.25) hold and also 


(5.26) X:V —> V is bounded, 

where X f(x) = X(f(a)). By (5.26) we mean that X(.S) is bounded in V when- 
ever S is a bounded subset of V. In most examples, X will be locally Lipschitz, 
which is more than sufficient. Granted these hypotheses, to get a good bound on 
Z(t) it suffices to have that 

(5.27) Y:V x V — L(W) is bounded, 


where 2)(f, g)(x) = Y (f(x), g(x)). Let us summarize this analysis: 
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Proposition 5.1. Let V and W be Banach spaces of (€-tuples of) functions for 
which e'” satisfies the estimates 


(5.28) lel) Se*, lle Ileawv) < C0, Mle — Ilewvywy < Ct’, 


for0 <t <T, with some 6 > 0,0 <7 < 1. Let X be a vector field, generating 
a flow F& on R*, satisfying 


(5.29) IF°(Allv < Ce, for \Ifllv < Ci, 

and, for || f||v = Ci, 

(5.30) IF (Allv < e“llfllv, 

for0<t<T, where F' f(x) = Fi (f(x)). Assume also that 

(5.31) *:V > VandQ:V x V > L(W)N L(V) are bounded, 


where Xf (a) = X(f(x)) and 


N(F,9)() =¥ (f(a), 9()) = | DX (ag(a) + (1-2) f(a)) ds. 


IffEeVv, we C((0,7],V) is a solution to (5.1), and v € C((0,T], V) is defined 
by (5.4)-(5.5), then, for0 <t <T, 


(5.32) llv(t) — uIlv < C((Ifllv) -n-?. 
Proof. The hypothesis (5.31) also yields an £(V)-bound on A(7) in (5.13), so 
we have 
t 
(5.33) lw@)llv S af el) |lw(r)|Iv dr + ||FOIIv, 
) 


where A is a constant and 


F(t) = | et" R(r) dr. 


From the hypotheses (5.28)-(5.31) and the consequent estimates in (5.17) and 
(5.23), we have 


(5.34) ||F(@llv < c(t) f (f—7)77 dr-n~ = Bi(|fllv) 2-87. 
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Thus 3(t) = ||w(¢)||v satisfies the integral inequality 
t 
(5.35) B(t) <A | e"—7) B(7r) dr + B-n-*t!-7, (0) =0, 
0 


where A and B depend on || ||. The conclusion (5.32) follows from Gronwall’s 
inequality. 


As one simple example of useful Banach spaces, we consider 
(5.36) V = BC'(R"), W=BC°(R®), 


where, as in (4.14), BC* (R”) denotes the space of functions whose derivatives of 
order < & are bounded and continuous on R” and extend continuously to the com- 
pactification R” via the sphere at infinity (approached radially). This is a subspace 
of the space BC*(R"), consisting of functions whose derivatives of order < k are 
bounded and continuous on R”. We want the functions to take values in R¢, but we 
suppress that in the notation. Suppose L is a constant-coefficient, second-order, 
elliptic, self-adjoint operator. If (5.1) is an € x & system, let us hypothesize the 
invariance property (4.4), so, with an appropriate norm on R*, we have that e'” 
is a contraction semigroup on V (and on W). Furthermore, the other estimates in 
(5.28) hold in this case, with y = 6 = 1/2. 
To investigate the estimate (5.30), we have 


ay (Fi fillect = IF Fllux + IDF Pll 
= sup || F%(f(2)) ne + Sup ||DF*(F) ° DF(@) Ip 


Thus (5.30) holds, provided 


(5.38) IFx(yllee < e“llulignes, OS*<T, 
and 
(5.39) IDF&Wlleay Se", O<t<T. 


As shown in § 6 of Chap. 1, DF{,(y) = G'(y) satisfies the linear ODE 


(5.40) <G'y) = DX(Fx(y))G'(y), Gy) = TL. 


Consequently, it is clear that (5.38) and (5.39) hold as long as X is a C'-vector 
field on R’, satisfying 


(5.41) IX(y)\Iee Se, I|]DX (Iles < ¢ 
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These conditions are also enough to yield the boundedness of the maps X : V > 
V and: Vx V > L(W), in (5.31), but in order to have boundedness of 
9:VxV—- L(V), we need C!-bounds on Y(f,g), hence C!-bounds on DX. 
In other words, we need X € BC?(IR°). We have the following result. 


Proposition 5.2. Let u € C([0,T], BC*(IR")) solve (5.1), and let v(t) be defined 
by (5.4)-(5.5). Assume that L is a constant-coefficient, second-order, elliptic oper- 
ator, generating a contraction semigroup on BC° (IR") and that X is a vector field 
on R*¢ with coefficients in BC?(R°). Then, for any bounded interval t € |0,T], 


(5.42) Ilu(t) — v(@)[lect < C(Ilflleer) -n-1/?. 


As another example of Banach spaces to which Proposition 5.1 applies, 
consider 


(5.43) V=H*(R"), W=H*-2(R"), k>S, 0<7<1. 


Assume k € Z*. Then (5.28) holds, with 6 = -y. We have the Moser estimate 


(5.44) |X (lle $ Ce(Iiflle~) «(14 fll); 
where 
(5.45) Cy (A) = Cy, sup {X)(f) [FL <A, [Hl < B}. 


Thus (5.31) is seen to hold as long as X € BC*(R*). To see whether (5.30) holds, 
we estimate (d/dt)||F' f||7,., exploiting (5.44) to obtain 


d 
Gl F lle = 2(X(FD) FF) a 


(5.46) 
< Col F*flle~) (IF fle + IF“ Ilr): 

Now for ||F'f || 7* > 1, the right side is < 2Cx(\|F'f\|L~)||F |? 4. If X € 

BC*(R°), there is a bound on 2C;,(||F' ||, ) strong enough to yield (5.30). We 

have the following result: 


Proposition 5.3. Assume k > n/2 is an integer. Let u € C((0, T], H*(R")) 
solve (5.1), and let v(t) be defined by (5.4)-(5.5). Assume that L is a constant- 
coefficient, second-order, elliptic operator, generating a contraction semigroup on 
L?(R"), and that X is a vector field on R° with coefficients in BC*(R°). Then, 
for any bounded interval t € {0,T], 


(3.47) I|u(t) — v(é) lle SC (If lla) 277, 
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for any y < 1. Furthermore, for any € > 0, 
(5.48) l|u(t) — v(t)llae-2 S Ce (lf lle) 27". 


It remains to establish (5.48). Indeed, if we set W = H*? (R”), we easily get 
R(t) |G < Cn, via use of |e” — Tl eww) < ct, instead of the last estimate 


of (5.28). Then we can use V = He (R”), replacing the second estimate of 
(5.28) by |le’”|| caw.) < C.t~“-€/?), and parallel the analysis in (5.33)-(5.35) 
to obtain (5.48). 

It is desirable to have product formulas for which the existence of solutions to 
(5.1) is a conclusion rather than a hypothesis. Suppose that v, given by (5.4)-(5.5), 
is compared, not with the solution u to (5.1), but to the function v, constructed by 
the same process as v, but using intervals of half the length. Thus, for an integer 
or half-integer k, define 


2k 
(5.49) aig (ene) (7), 
and set 
~ sLqos~ k 1 
(5.50) v(t) =e"F* vy, fort =—+5,0 <s<—. 
n 2n 
Parallel to (5.6), we have 
Ov _ _ a 
(5.51) Y= 10+.X(v) + Rit), 
ot 
where, fort = k/n+ 8,0 <s <1/2n, 
(5.52) R(t) = (8 — IX (F8 Gq) + [X(F% Gq) — X (eX F? })]. 


Consequently, w = v — U satisfies the PDE 


Ow 


3p = Lit A(t, xz) + R(t) — R(t), @(0) =0, 


(5.53) 
where, parallel to (5.11), 
(5.54) A(t,z) =Y (0(t, 2), v(t, 2). 


Pick Banach spaces V and W as above, and assume f € V. As long as the 
hypotheses (5.28)-(5.31) hold, we again have 


(5.55) |R(t) — R(t)|lw < Cn, O<t<T. 
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We also have bounds on ||v(¢)||y and ||v||y, independent of n, hence bounds on 


A(t, x), so the analysis in (5.33)-(5.35) extends to yield 
(5.56) v(t) —dH|lv < Cn, O<t<T. 


Consequently, if we take n = 2) and denote v, defined by (5.4)-(5.5), by U(j)> 
so ¥ is logically denoted u(;41), we have 


(5.57) {ug : J € Zt} is Cauchy in C((0, 7], V), 


and the limit is seen to satisfy (5.1). 

There are some unsatisfactory aspects of using the smoothing of e*” that fol- 
lows when LF is elliptic. For example, Propositions 5.1-5.3 do not apply to the 
Fitzhugh—Nagumo system (4.2), since the operator L given by (4.3) is not elliptic. 
We now derive a convergence result that does not make use of such a hypothesis; 
the conclusion will be weaker, in that we get convergence in a weaker norm. We 
will establish the following variant of Proposition 5.1: 


Proposition 5.4. Let V and W be Banach spaces of (€-tuples of) functions for 
which e'” satisfies the estimates 


658) lela) <Se%, lle llean Se, llet® — Tlewany $ C8, 
for 0 <t < T, with some 6 > 0. Let X be a vector field on R‘, generating a 
flow Fx, whose action on functions via F' f(x) = F%(f(«)) satisfies (5.29) 


(5.31). Take f € V. Then (5.1) has a solution u € C((0, T], W), and the function 
VE C((0, TI, V) given by (5.4)-(5.5) satisfies 


(5.59) v(t) — ul(t)|lw < Cn, O<t<T. 


Proof. If v and ¥ are defined by (5.4)-(5.5) and by (5.49)-(5.50), we will show 
that 


(5.60) sup ||v(t)|lv < B, 
0<t<T 


with B independent of n, and that 
(5.61) v(t) — (lw <Cn~, O<t<T. 


In fact, the hypotheses (5.58) together with (5.29)-(5.30) immediately yields 
(5.60). If we also have (5.31), then there is the estimate 


(5.62) |R@|lw < Cn, ||RO)\lw < Cn, 
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established just as before. Again, w = v — v solves the PDE (5.53), and hence, 
parallel to (5.33)-(5.34), we have 


t 
(5.63) IEW <A fe] GE) dr + C'n-F, |f(O) ww = 0. 
0 
Thus Gronwall’s inequality yields (5.61), and the proposition follows. 
Note that in Proposition 5.4, we can weaken the hypothesis (5.31) to 


(5.64) *%:V —>V and2): V x V > L(W) are bounded, 


omitting mention of L(V). Let us also note that the limit function u € 
C([0, T], W) also satisfies 


(5.65) we L™ (0, T], V), 
provided V is reflexive. 


Proposition 5.4 essentially applies to the Fitzhugh-Nagumo system (4.2), 
which we recall: 


2 
ap aria 
ee 2. (uv — yw) 
aN 


As we did in §4, we modify the vector field X(v,w) = (f(v) — w,e(v — yw)) 
outside some compact set to keep its components and sufficiently many of their 
derivatives bounded. 


Exercises 
1. Investigate Strang’s splitting method: 


u(t) = kim, ae aia opt oy f). 
Obtain faster convergence than that given by (5.48) for the splitting method (5.3). 

2. Write a computer program to solve numerically the Fitzhugh-Nagumo system (4.2), 
using the splitting method. Take M = S*. Use (4.32) to specify the constants 7, a, and 
e€. Alternatively, take y = 10. Try various values of D. Use the FFT to solve the linear 
PDE 0v/dt = Dd?2v, and use a reasonable difference scheme to integrate the planar 
vector field X. 
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6. The Stefan problem 


The Stefan problem models the melting of ice. We consider the problem in one 
space dimension. We assume that the point separating ice from water at time ¢ is 
given by x = s(t), with water, at a temperature u(t, x) > 0, on the left, and ice, at 
temperature 0, on the right. Let us also assume that the region x < 0 is occupied 
by a solid maintained at temperature 0. In appropriate units, u and s satisfy the 
equations 


(6.1) Ut = Use; u(0,2) = f(x) 
u(t,0)=0, u(t, (8) = 0, 

where 0 < x < s(t) and 

(6.2) §=—auy,(t,s(t)), s(0) =1. 

We suppose f is given, in C™(I), I = [0,1], such that f(a) > 0 and f(0) = 

f(1) = 0. In (6.2), a is a positive constant. 


It is convenient to change variables, setting v(t, x) = u(t, s(t)x), forO<a<1. 
The equations then become 


ut = s(t) 2022+ 2 r, (0,2) = f(2), 
Ss 


(6.3) 

u(t, 0) = 0, u(t, 1) =0, 
and 
(6.4) $= -< vp(t, 1). 


Note that (6.4) is equivalent to (d/dt)s? = —2av,(t, 1), so if we set E(t) = s(t)”, 
we can rewrite the system as 


(6.5) vu, = €(t)~ Vee + set v(0,x) = f(x), v(t,0) = v(t, 1) = 0, 


(6.6) E(t) =—2av,(t,1),  €(0) = 1. 
Note that the system (6.5)—(6.6) is equivalent to the system of integral equations 
t 
(6.7) u(t) = e% 04 py | B(r)e™4 (wun(r)) dr, 
0 


t 
(6.8) é(t) =1—- 2a | Uz (7, 1) dr, 
0 
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where 


“ft ft gay 80) _ 180) 
6) aen= | Caaf aap B= 35 -ae0) 


Here, e' is the solution operator to the heat equation on R+ x J, with Dirichlet 
boundary conditions at z = 0, 1. 

We will construct a short-time solution as a limit of approximations as follows. 
Start with 9(t) = 1 — 2af’(1)t. Solve (6.5) for v1 (t,x), with € = €). Then set 
&i(t) =1- 2a is 0,01(T, 1) dr. Now solve (6.5) for vo(t, x), with € = €;. Then 
set €2(t) = 1 — 2a In 0,v2(T, 1) dr, and continue. Thus, when you have €,(t), 
solve for vj+1(t, x) the equation 


1é; 
— 7). 2), =A. 25d. : ; = 
(6.10) att Gi(t) Onvj41 + 5 g, 7 Ont v341(0,2) = f(x), 
vj41(t, 0) = us41(¢, 1) = 0. 
Then set 
t 
(6.11) &)41(t) =l1- 20 f Orv; 41(T, 1) drt. 
0 


Lemma 6.1. Suppose €;(t) satisfies 


(6.12) (0) =1, €(0)=-2af'(1), & >0. 

Then &;+1 also has these properties. 

Proof. The first two properties are obvious from (6.11), which implies 
(6.13) &41(t) = —2a 0205 41(t, 1). 

Furthermore, the maximum principle applied to (6.10) yields 

(6.14) upalt,2) = 0. 

Since v;+1(t, 1) = 0, we must have 0,v;+1(#, 1) < 0. 


The PDE (6.10) for vj;41 is equivalent to 


t 
(6.15) vj41(é) = eri (LOA F +f By(r)ewseDA (20,0;41(7)) dr, 
0 


where 
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"dn Pewee sic) 
Gay? PO = aE 


(6.16) a,;(t,T) = 
One way to analyze e’ on functions on J is to construct S', the “double” of 

I, and use the identity 

(6.17) e'4g = ped (Og), 

where Og is the extension of g € L?(I) to Og € L?(S'), which is odd with 

respect to the natural involution on S$! (i.e., the reflection across OJ), and pG is 

the restriction of G € L?(S") to I. It is useful to note that 


(6.18) O:Ci(1) 3 C'(S"), ford <r <2, 


where Cf (I) is the subspace of u € C’(Z) such that u(0) = u(1) = 0. Ifr = 
1+yp, 0 <p <1, then C"(I) = C!(J). Furthermore, 


(6.19) 626" Os"). 
It is useful to note that 
(6.20) e* (Org) = OreNg if g € C*(1), 
where egA is the solution operator to the heat equation on Rt x J, with Neumann 
boundary conditions, as can be seen by taking the even extension of g to S?. 
Hence (6.15) can be written as 
vs (8) = eA F 


(6.21) t e 
+ J Bj(r) (Bxei# 9 My — e964 osya(r) dr 
0 


where M,, is multiplication by z. In analogy with (6.17), we have 
(6.22) eng = pe™(Eg), 
where Eg is the even extension of g to S'. In place of (6.18)—(6.19), we have 


E:C'(1I) > C(S'), for0<r<1, 


6.23 
van) E:0%l(L) — C%1(81), 


We now look for estimates on vj,, and €;41. The simplest is the uniform 
estimate 


(6.24) lluj+i()Ilz~ < [fllae, 
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which follows from the maximum principle. Other estimates can be derived using 
(6.15) and (6.21) together with such estimates as 


(6.25) lle“ gllor S Ct IIg|lz~, le gllor < CH"? [Ig I z=, 


valid for any f € D©(I), any r > 0, t € (0, T], with C = C(r, T). Hence, using 
(6.21), we obtain, for 0 < ys < 1, an estimate 


(6.26) Iloj4i()llow < lle fllow + Aj (If lln~, 
where 
(6.27) =A fv B(7 —A1+H)/2 gy, 


Now, by (6.16), a;(t,7) > &(t)7!(t — 7), granted that €; > 0, so 


Aj, (t) < AE, (t)+M)/? * &(r) iz (t—7r)7C+)/? dr 
(6.28) ae 0 &(7) 


< BE, (Or? sup &(7)| pa-w)/2, 
O<r<t 


Now we can apply (6.21) again to obtain, forO<r<1,0<p<1,u+rAl, 


(6.29) lluj41(8)|| utr < le?" flax + Aj,(t) aoe \|v;41 (7) lon, 


using 


Ile“ gl cut < Ce" glee, rou, HE (0, 2), gE Cet), 


(6.30) tA —r/2 Tr 
len Iloute < Ct lgllce, r>0, hE (0,1), gECc (1), 


fort € (0, 7]. If w +r = 1, it is necessary to replace C1 by the Zygmund space 
C!. Combining this with (6.26), we obtain, for 


(6.31) Njr(t) = sup |lv;(7)Ilor, 
O<r<t 


the estimates 


Nj+1u4r(t) S Collfllontr + CrAgrOllfllow 


(6.32) 
+ CrAjr(HAju Ol flee: 


Recall that 
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(6.33) (7) < 2allv; (7) IIc. 
Hence 


where Nj; (t) is the case r = 1 of (6.31). Therefore, by (6.28), 
(6.35) Ajy(t) $ 2aB[1 + a(1 + WN Oe] Nja(eO-?, 
Consequently, taking r = p € (1/2,1), so 2 € (1,2), we have 
(6.36) Nya1j2n(t) < P(N (tt, Nae”), 


where P(X, Y) is a polynomial of degree 4, with coefficients depending on such 
quantities as || f'||c¢2., but not on j. A fortiori, we have 


(6.37) Ny+ia(t) < P(Njr(t)t, Nj (tG-/?). 
Such an estimate automatically implies a uniform bound 
(6.38) Nyi(t) < K, fort € [0,7], 


for some T' > 0, chosen sufficiently small. Appealing again to (6.36), we conclude 
furthermore that there are uniform bounds 


(6.39) Njop(t) < Ky, fort € [0,7], 2 <2. 

That is to say, 

(6.40) {u; :j € ZT} is bounded in C([0,T],C"(I)),  r <2. 
Taking r = 1, we conclude that 

(6.41) {€; : j € Z*} is bounded in C*({0,7]). 


Of course, we know that each €,(t) is monotone increasing, with €;(0) = 1. 
From (6.40) and (6.18), we have 


(6.42) {Ov,; : 7 € Z*} bounded in C([0,T],C"(S")),  r <2. 


Also, {E(xv;) : 7 € ZT} is bounded in C((0, 7], C°(S*)), so we deduce from 
(6.10) that 
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(6.43) {0,(Ov,) : j € Z*} is bounded in C([0,7],C,°-"(S")), <2. 
Interpolation with (6.42), together with Ascoli’s theorem, gives 

(6.44) {Ov,; :€ Z*} compact in C? ([0, T],C7~77(S")), 

for r < 2, a € (0,1). It follows that 

(6.45) {vj : 7 € ZT} is compact in "ar, Cl(1)), forall 5 > 0. 
Consequently, (6.41) is sharpened to 

(6.46) {€; : 7 € ZT} is compact in Cr (0, T)), 

for all 6 > 0. It follows that {v,} has a limit point 


(6.47) ee fy. COT," a), 
0<o<1,6>0 


and {€;} has a limit point 


(6.48) €€ () c%/?-9((0,7)). 


6>0 


It remains to show that such v,€ are unique and give a solution to the Stefan 
problem. 
To investigate this, choose Co(t) satisfying 


(6.49) Go(0)=1, (0) =—2af'(1), ¢o = 0; 


define w(t, 2) to solve (6.5) with € = Co; then set 


t 
6(t) =1— 2a f O,w1(T, 1) dr, 
0 


and continue, obtaining a sequence w,;,¢;, 7 € Z*, in a fashion similar to that 

used to get the sequence v;,€;. As in Lemma 6.1, we see that each ¢; satisfies 

(6.12). Now we want to compare the differences v; — w, and €; — ¢; with vj41 — 
wy41 and €541 — ¢j41. Set V = vj41 — wy41. Thus V satisfies the PDE 

av 16; oe 

—— =f, Vex ave + (= a 

Ot Z 2 &; & Sj 

(6.50) ; ; 

§ = 6 xO Wi+1 

G&G es 


) Ova Wj +1 
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together with 
(6.51) V(0,2)=0, V(t,0) = V(t,1) =0. 


Note that the analysis above also gives uniform estimates of the form (6.40)- 
(6.46) on w; and ¢;. Now, for V we have the integral equation 


V(t) = | Bj(r)e)4 (2,.V (7) dr 


o Ge 7 aa) eA; 41 (T) dr 


1 ft (S30) _ G9) eH 4h (2, wi41(7)) dr 
af (#8 zat ne 


= $,4+ So4+ Ss, 


(6.52) 


where 3;(7) and a,;(t,7) are as in (6.16). As in (6.21), we replace the integrand 
in S; by 


(6.53) 8;(r) (ane5f 7" Me = ensKeele) V(r). 


Thus 


I|Sillor < a| Bj (r)ag(t,7)- 7 ||Vr)llon ar 


(6.54) < BE (t)"? ou (| | (t— 1)? |Vr)llon ar 
O<r<t 0 


<Ct? sup |ViAllo, 
O<r<t 


provided 0 < ¢ < JT’, with T’ small enough that the uniform estimates on €; and 


& apply. 
It does not seem feasible to get a good estimate on $2 in terms of the C!-norm 
of w;+1, but we do have the following: 


t 
|Sallo1 < al léi(7) — ¢j(7)| ay (t, 7) O-? |Jwz4a(r)\|orte dr 
(6.55) ° 
<C sup |é(7)—¢(7)|- sup |wjyi(T)Ilorsn th”, 
O<r<t O<r<t 


for any ys € (0, 1). Finally, 
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G(T) _ &y(7) 


6.56 S3llo1 <C 
( ) | alle = sup é;(7) G(r) 


O<r<t 


| p/2, 


sup ||wj+1(7)| 
O<Tr<t 


Consequently, for ¢ < T sufficiently small, 0 < pw < 1, we have 


1 
eee IV r)Ilow 
O<TS 
(6.57) SC sup [s(r)— Gr) sup lhossa(r)llorrn 0? 
Ej(r)  ¢;(r) i 
+C = : ge 
Se) OG) aoe lwe+a(7)llo 
Therefore 
sup [&+1(7) ~ G+(7)| 
O<r<t 
ey <C sup [&j(7) — G(r)|- sup llwjsr(r)Ilorre 0”? 
O<7r<t 0<r<t 


+C sup |&(7)—G(7)|_ sup _|lwj4i(r) Io 2”. 
O<r<t O<r<t 
It follows easily that, for T’ small enough, 


(6.59) lf — Gallex(o,ry) 4 © and |lv; — wyllcqo,r),c1~)) > 9; 
as 7 —> oo. Thus we have the following short-time existence result: 
Proposition 6.2. Given f © C™(I), f > 0, f(0) = f(1) = O, there are a 


T > 0 and a unique solution v, € to (6.5)-(6.6), satisfying (6.47)(6.48). Hence 
there is a unique solution u, s to (6.1)-(6.2) on0 <t < T, satisfying 


(6.60) ue C7([0,T],C77(D), s¢c%?-*(0,T]), #20, 
forallo €{0,1),r<2,6>0. 

We want to improve this to a global existence theorem. To do this, we need 
further estimates on the local solution v, s. First, it will be useful to have some 


regularity results on v not given by (6.60). 


Lemma 6.3. The solution v of Proposition 6.2 satisfies 
5 
(6.61) v €C((0,T], A" (1)) NC" ([0,T],H"~7(1)), forallr < 5 


Proof. This follows by arguments similar to those used above, plus the following 
variant of (6.18): 
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s 8 1 1 

(6.62) O: H*(I) — H*(S°), for0<s< 5. 
We know from (6.60) that xv,(7) is continuous in 7 € [0,7] with values in 


C!-<(I) c H'-2*(I1) C H2~£(1), so use of the integral equation (6.7) gives 
(6.61). 


We now take a look at 


1 
(6.63) h(t) -| u(t, x) da, 
0 


which, by (6.14), is ||u(t)||,1. We have 


1 2 pl 
(6.64) < = al 07 v(t, x) ae+5e x0, v(t, 2) dx, 


and integrations by parts give for the right side: 


1 E 1é 7? 
eo ee mE) = ae t,x) d 
os z( Bag ) is | we) de 


Consequently, 


1 
6.66 —(sh) = —- — —0,v(t,0), 
(6.66) (sh) =—~ — [dev(t, 0) 
and integration of this gives 


(6.67) sh) + 28) + f Ta.v(r.0) ar = + flo) ae 


a a 


From (6.14), 0,,v(t,0) > 0, so each term on the left side of (6.67) is positive. This 
gives the upper bound in the two-sided bound, 


1 
(6.68) 1 <s(t)<1+ af f(x) dx = A, 
0 


the lower bound following from the monotonicity s > 0. Thus 1 < &(t) < A’, 
Hence, in (6.7)—(6.9), we have 


(6.69) A(t—T) < a(t,r) <t—r. 
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We can likewise examine (d/dt)||v(t)||?, but since that analysis won’t be cru- 
cial for our existence theorem, we leave it to the exercises. 

We next look at the rate of change of ||0,,v(t)||7.2. Since O,v = 0 for x = 0, 1, 
we have 


“iaev(t) lz = (0,.0,0, O20) = — (Av, 020) 


1 
2 
g 
g 


(6.70) 1 1 
2,,||2 2 

= ~glleeellie 5 (xd,v, 020), 
and an integration by parts, plus use of 0,v(t, 1) = —€(t)/2a, gives 


gig 
gat 7 lors. 


d 2 é 
(61) Zd.0(0) 2 = —ZllO2elhia — $- 
or, equivalently, 
ie 


djl 2 2 2 
(6.72) = 5 (-lleellz2) = ~¢lleeelle ~ €° Bat’ 


Since the right side of (6.72) is < 0, this gives, upon integration, 
(6.73) lnv()lli2 < sx lize. 


Note that, by (6.68), the right side is < Al|0, f||7,2, which is independent of t. 
Using (6.73) and (6.7), we have, for r € [1, 2), 


t 
(6.74) u(t) lar < le" fll + Celia f Blr)att,7)-"? ar, 
0 


and the first term on the right is < || f|| 7, since 
fe (DNA) => Of ¢ a(S), 


for r < 5/2. By (6.69), we deduce that, for r € [1, 2), 
t 
(6.75) o(é)llae < Cr+ Ce | a(n) (t—1)-"? ar, 
0 


with C; = C;(r)||f\|a- independent of ¢. Taking r € (3/2,2), and using 
$(t) = —vz(t, 1)/2a, which is < Cllv|| a, we deduce that 


t 
(6-78) 3(t) < Ki + Ke | 3(r)(t — r)77/? dr. 
0 
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From this we can establish the following important estimate: 


Lemma 6.4. [fv solves (6.3) forO <t < T and satisfies (6.61), then 


(6.77) sup 8(t) < Ko, 
0<t<T 


where Ko = Ko(||f\la~) (r¥ = 38/2 + €) is independent of T. 


Proof. Pick p > 0 small enough that f/ t~"/? dt < 1/2K. Thus, writing the 
interval [0,t] as [0,¢ — p] U [t — p, t], we have 


t 
(6.78) Ka | 8(r) (t—7)7"? dr < 5 sup 3(r) + Cop-"/? [s(t — p) — 1]. 
0 


TA 


We conclude from (6.76) and (6.68) that 


1 
(6.79) sup 4(t)<= sup 4(t)+ Ky, + Kop7"/allf\ln, 
0<t<T 2 o<t<r 


which gives (6.77). 


Returning to (6.75), we deduce that the solution to (6.3) given by Proposition 
6.2 satisfies, for any r € [1, 2), 


(6.80) leOllareay < Kr, OStS<T, 


with KC, independent of T. We know that xv,(7) has an H*-bound for any s < 1, 
and, via (6.62), we can use such a bound on xv,(7) for s < 3; to conclude, via 
(6.7), that (6.80) holds for any r € [1,5/2). Now familiar methods establish the 


following: 


Theorem 6.5. Given f € C™(I), f > 0, f(0) = f(1) = 0, there is a unique 
solution v, € to (6.5)-(6.6), defined for all t € (0,00), satisfying 


(6.81) v € C([0,00), H”(1)) NC*([0, 0), H” (1), r< a 
and 
(6.82) €€C*([0,00)), p< . 


We now tackle the task of showing that wu and s, or equivalently v and €, are 
smooth for t € (0,00) = J. It is convenient to set 


(6.83) V(T, x) =v(t,z), T=a(t,0) = on dr, 
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so OrV(T, x) = €(t)0;v(t, x), and we have 


OV i 
(6.84) BT = Vig + o(T)eVe, o(T) = 5S (t), 
with 
(6.85) V(0,«) = f(x), V(t,x) =0, fora € OL. 


Note that (6.6) is equivalent to 
(6.86) o(T) = —aV,(T, 1). 


In place of (6.7), we use 


T 
(687) vety= ep f o(r)el™4 (ever) ar. 
0 


The results (6.81) and (6.82) imply 


(6.88) 
V € C((0, 00), H"(I)) NC1((0, 00), H"-2(1D), +o € C'/?-4([0, 00), 


for any r < 5/2, 5 > 0. Consequently, V € C1/?-9([0, 00), H3/2-9(1)) for all 


5 > 0, so o(r)2V,(r) € Cl/?-9([0, 00), H1/?-*(1)). The following lemma is 
useful. 


Lemma 6.6. Suppose 
T 
G(T) = ef —-7)4 Bir) dr. 
0 


Then, for any r > 0, s € (0,1/2), 
F € C((0, 00), L?(1)) NC" ((0, 00), H°(1)) => G € C"((0, 00), H**?-*(D)), 
for all 6 > 0. 


The proof is straightforward. Applying this to F(7) = o(r)aV,.(7), we deduce 
that the right side of (6.87) belongs to C!/?—° ((0, 00), H®/?-9(Z)), for all 6 > 0. 
Thus, with J = (0,00), 


(6.89) Veor (1H? ()). 


Note that this is stronger than the first inclusion in (6.88). Making use of the PDE 
(6.84), we deduce that Vr € C1/?-°( J, H!/2-9(1)), so 
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(6.90) vec? (A *(7). 
Interpolation of (6.89) and (6.90) gives 
(6.91) Ver "(2 @)) 
hence, by (6.86), 
(6.92) cer "Uh 
Now we have improved all three parts of (6.88), essentially increasing the degree 


of regularity in T by one half, at least for T € J = (0, co). Iterating this argument, 
we obtain 


Veo (LPP Dn) neh? sw? O), 


(6.93) 
a € QU28/2-5( 7), 


for each j € Z*. We are well on the way to establishing the following: 
Proposition 6.7. The solutions v, € of Theorem 6.5 have the property 

(6.94) v €C™((0,00) x I), € €C™((0,00)). 

Proof. That € € C™(.J) follows from o € C%(.J). For the rest, it suffices to 


show that V € C™(J x I). We get this from (6.93) together with the PDE (6.84). 
In fact, this yields 


(6.95) Vex CO? (LAPD), 
hence 
(6.96) Veort, A (1). 


Iterating this argument finishes the proof. 


A number of variants of (6.1)—-(6.2) are studied. For example, one often sees a 
nonhomogeneous boundary condition at the left boundary: 


(6.97) u(t, 0) = g(t). 
Or the boundary condition at x = 0 may be of Neumann type: 


(6.98) uz (0,t) = A(t). 
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Both the PDE and the boundary conditions may have t-dependent coefficients. 
For example, the PDE might be 


(6.99) up = A(t)Uae- 


Some studies of these problems can be found in Chap. 8 of [Fr1] and in [KMP], 
where particular attention is paid to the nature of the dependence of the solution 
on the coefficient A(t), assumed to be > 0. 

There are also two-phase Stefan problems, where the ice is not assumed to be 
at temperature 0, but rather at a temperature u;(t,2) < 0, to be determined as 
part of the problem. Furthermore, these problems are most interesting in higher- 
dimensional space. More material on this can be found in [Fr2]. 


Exercises 
1. If v solves (6.3)-(6.4), show that 


d 2 
5 (slle@llz2) =— Slee Ize: 


hence 


t 
= 2 
s(t)|lv)|IZ2 +2 f s(r)~*||ve(7)||p2 ar = lI flZ2- 
) 
2. If wu satisfies (6.1)—(6.2), show that 


1 s(t) 
(6.100) s(t)? =1+4 2a | xf (x) dx — 2a [ xvu(t, x) dx. 
ty) 0 
Compare the upper bound on s(t) this gives with (6.68). 
Show that, conversely, (6.1) and (6.100) imply (6.1)—(6.2). This result, or rather its ana- 
logue in the more general context of the nonhomogeneous boundary condition (6.97), 
played a role in the analysis in [CH]. 
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In this section we begin to study the initial-value problem 


Ou 


(7.1) a 


=> A! (t,x, Dhu) 0j0pu+ Bt,c, Diu), u(0) = f. 
j,k 


Here, u takes values in R“, and each AJ* can be a symmetric K x K matrix; 
we assume AJ* and B are smooth in their arguments. We assume for simplicity 
that x € M = T”, the n-dimensional torus. Modifications for a more general, 
compact M/ will be contained in the stronger analysis made in § 8. We impose the 
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following “strong parabolicity condition”: 


(7.2) >> AM (t, 2, DiwésEx > Colél?!, 


ik 


where to say that a pair of symmetric K x K matrices S; satisfies S; > Sy is to 
say that 5; — Sy is a positive-semidefinite matrix. 

We will use a “modified Galerkin method” to produce a short-time solution. 
We consider the approximating equation 


0 L 
7a = JS” AP*(t,2, Di Jete)Oj;OpTete + JeB(t, x, Di Jette) 
Oe) = JeLeJeue + JeBe, 
ue(0) = Jef. 


Here J- is a Friedrichs mollifier, which we can take in the form 
Je — y(e V —A); 


with an even function y € S(R), y(0) = 1. Equivalently, the Fourier coefficients 


f(k) of f € D’(T”) are related to those of J. f by 


(Jef) (k) = p(elkl) f(), KE Z". 


For any fixed ¢ > 0, the right side of (7.3) is Lipschitz in u, with values in 
practically any Banach space of functions, so the existence of short-time solutions 
to (7.3) follows by the material of Chap. 1. Our task will be to show that the 
solution u_ exists for ¢ in an interval independent of ¢ € (0, 1] and has a limit as 
€ \, 0, solving (7.1). 

To do this, we estimate the H’-norm of solutions to (7.3). We begin with 


d 
(7.4) ql Pe ue (lize = AD" Jebeletin, Di.) a 2(D° Bz, D° Jeug). 


Since J- commutes with D® and is self-adjoint, we can write the first term on the 
right as 


(7.5) (LD Jetix, D? Li) OU D™, DA Jeti, DA Jets). 


To analyze the first term in (7.5), write it as 


(7.6) &(Lev,v) = -2 5° ( AI jv, pv) + 2S °([A2*, Oe] Oj, v), 


ik 
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where AZ* = AJ*(t,x,D1J-u-) and v = D*J_-u-. Note that, by the strong 
ellipticity hypothesis (7.2), we have 


(7.7) S ° (Ad dj0, ev) > CollVulli2- 
The commutator [D°, L-] = )>[D®, A2*] 0,0; can be treated using the Moser 


estimates established in Chap. 13. From Proposition 3.7 of Chap. 13 we deduce 
that 


I|[D°, Le}wllz2 
8) < CY (JAM |a0jcell~ + IVA = [0j9x0 lr), 
j,k 
provided |a| < @. Since [A2*, 0,]w = —>>(0;A2*)w, we have the elementary 
estimate 
(7.9) II[A2*, AJOjvllz2 < CVA |[z~ ljullz2- 


Furthermore, Proposition 3.9 of Chap. 13 implies 
(7.10) |A2* || pe < Ce([Jetellor) (1 + I Jewell aes), 


and we have the elementary estimate ||V.AJ*||,~ < C(||Jeuellc1) ||Jeuelloz- 
Hence (7.5) is less than or equal to 


—2Cp||VD° Jeue(t)||Z2 


(7.11) ‘4 
+ C(||Jeve|lcr) || Jee llor (1 + || Jette || ze+1) : ||D JeUe||p2. 


Furthermore, we have a bound 
(7.12) 2(D* Bs, D” Jette) S C(|[Jeuellor) (1 = || Jette || pre+1) : ||D° Jeue|| x2, 


by the analogue of (7.10) for || B(t, 2, Di J-u-)|| ze. Consequently, we have an 
upper bound for (7.4). Summing over |a| < @, we obtain 


d 
alte Ollize S — 2Col| Feteell fers 


+ C1 (|[Jette||oz) (1 + || Jete|| zre+1) || Jewell we. 


(7.13) 


Using AB < CyA? + (1/4C) B?, with A = || J-ue|| ze+1, we obtain 


d 
(7.14) Gls Olle < —Co||Jeve|| Fre+1 a C2(||Jevello2) (|| Jevtel| Fre Ly 
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In particular, since { J. : 0 < € < 1} is uniformly bounded on each space C)(M) 
and H*(M), we have an estimate 


d 
(7.15) lle )llize S Ce(ee lle) (Ite ()llire + 1).- 
This estimate permits the following analysis of the evolution equations (7.3). 


Lemma 7.1. Given f € H*, € > n/2 + 2, the solution to (7.3) exists for t in an 
interval I = |0, A), independent of €, and satisfies an estimate 


(7.16) luc(t)|lne SK), tel, 
independent of < € (0, 1]. 


Proof. Using the Sobolev imbedding theorem, we can dominate the right side of 
(7.15) by E(||ue(t)||F,c), so ||we(t)||F-¢ = y(t) satisfies the differential inequality 


(7.17) = < Ely), y(0) = || fllze- 


Gronwall’s inequality then yields a function K(t), finite on some interval 
I = (0, A), giving an upper bound for all y(t) satisfying (7.17). This J and 
K(t) work for (7.16). 


We are now prepared to establish the following existence result: 
Theorem 7.2. [f (7.1) satisfies the parabolicity hypothesis (7.2), and if f € 


H*(M), with € > n/2 + 2, then there is a solution u, on an interval I = [0,T), 
such that 


(7.18) u€ L©(I,H"(M)) 9 Lip(I, H*-?(M)). 
Proof. Take the J above and shrink it slightly. The bounded family 
u, € C,H) nc'd, At”) 


will have a weak limit point u satisfying (7.18). Furthermore, by Ascoli’s theorem 
(in the form given in Exercise 5, in § 6 of Appendix A), there is a sequence 


(7.19) Ue, —>u inC(I, H*?(M)), 


since the inclusion H’ + H‘~? is compact. In addition, interpolation inequalities 
imply that {uz : 0 < € < 1} is bounded in C? (I, H‘~??(M)) for each o € 
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(0,1). Since the inclusion H’~2” <> C?(M) is compact for small o > 0 if 
> n/2-+ 2, we can arrange that 


(7.20) Ue, —>u inC(I,C?(M)). 
Consequently, with e = €,, 


Je S- AJ* (t,x, Di Jette) 0; On JeWe —> S- AJ¥(t, x, Diu) 0; One, 


(7.21) 
J-B(t, 2, DiJ,u,) —> B(t, 2, Diu) 

in C(I x M), while clearly Ou., /Ot + Ou/Ot weakly. Thus (7.1) follows in the 

limit from (7.3), and the theorem is proved. 


We turn now to questions of the uniqueness, stability, and rate of convergence 
of ue to u; we can treat these questions simultaneously. Thus, with e € [0,1], we 
compare a solution u to (7.1) with a solution u- to 


Jue 
= jk 
(7.22) a =e YA ta, De Jez) 0;0nJet. + IBGE, 0, Jets), 
u-(0) =h. 


For brevity, we suppress the (¢, 2)-dependence and write 


ot = £(Dlu, Du+ B(Du), 
(7.23) Be 
AE = J1(D Ii, 2 date + IBD I), 


Let v = u — ue. Subtracting the two equations in (7.23), we have 


Ov 


(7:24) a L(Diu, D)v + L(Diu, D)ue — JeL( Di Jeug, D) Jette 


+ B(D)u) — JeB(Di Jette). 
Write 
L(Diu, D)ue — JeL( Di Jette, D) Jette 
(7.25) = [L(Dju, D) — L(Djue, D)] ue + (1 — Je) L(Djue, D)ue 
+J-L(Djue,D)(1 — Je)ue + Je |L(Djue, D) — L(D} Jette, D)| Jeue 
and 


B(Dju) _ JB DJ Ms) = [B (D; pu) — B(Dzuz)| 
(7.26) (b= 4.) BD i) 
+ J-[B(Djue) — B(DzJete)]- 
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Now write 


B(Diu) — B(Diw) = G(Dju, Diw)(Diu— Diw), 


(7.27) 1 
G(Dj;u, Dw) = i B'(rDyu + (1 — 7) Dw) dr, 
0 


and similarly 
(7.28)  L(Dju, D) — L(Diw, D) = (Diu — Diw)- M(Dju, Diw, D). 


Then (7.24) yields 


(7.29) a = L(Dju, D)v + A(D}u, D2ue)Div + Re, 
where 

A(Dju, D2uz) Div 
(7.30) 


= Diu- M(Dtu, Diue, D)ue + G(Dzu, Diue) Div 


incorporates the first terms on the right sides of (7.25) and (7.26), and R, is the 
sum of the rest of the terms in (7.25) and (7.26). Note that each term making up 
R- has as a factor J — J-, acting on either Diuz, B(Diuz), or L(Dihuc, D)ue. 
Thus there is an estimate 


(7.31) |Re(t)\lz2 < Ce(llue(t)|lo2) (1+ |lue() [I Fxe) re(€)?, 
where 
(7.32) re(e) = ||Z — Jel cce-2,22) © ||I — Jell cure): 


Now, estimating (d/dt)||v(t)||%2 via techniques parallel to those used for 
(7.4)-(7.15) yields 


(7.33) Sloot: < C§HlluOlize + SO, 

with 

(7.34) C(t) = C(|lue(Mlle2 luMllez), SO = [|ReOllze- 
C(r) dr, 


Consequently, by Gronwall’s inequality, with A(t) = tis 


(7.35) | 


t 
v(t)[B2 < ek (i ile +f seek ir) | 
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for t € [0, 7’). Thus we have 


Proposition 7.3. For £ > n/2+ 2, solutions to (7.1) satisfying (7.18) are unique. 
They are limits of soutions uz to (7.3), and, fort € I, 


(7.36) Ilu(t) — we) IIz2 S Ka) — Sell ccae-2,12)- 


Note that if Je = y(eW—A) and » € S(R) satisfies p(X) = 1 for |A| < 1, we 
have the operator norm estimate 


(7.37) IZ — Jellccae-2,12) < Ce*?. 


We next establish smoothness of the solution u given by Theorem 7.2, away 
from t = 0. 


Proposition 7.4. The solution u of Theorem 7.2 has the property that 
(7.38) ueC~((0,T) x M). 


Proof. Fix any S < T and take J = (0,5). If we integrate (7.14) over J, we 
obtain a bound on J; || J-ue(#)||7,c+1 dt, provided we assume £ > n/2+ 2, so that 
we can appeal to a bound on O(||.J-we(t)|c2) and on || J-ue(t)||7,., fort € J. 
Thus 


(7.39) u € L?(J,H"t!(M)). 


Recall that we know u € Lip(I, H‘~?(M)). It follows that there is a subset € of 
I such that 


(7.40) meas(€)=0, tp €1\€ => u(to) € H**1(M). 
Given tp € I \ E, consider the initial-value problem 


OU 


(41) a= 


S¢ AP*(t, a, DZU) O;O,U + B(t,2,D,U), U(to) = u(to). 
By the uniqueness result of Proposition 7.3, U(t) = u(t) forto <t < T. 

Now, the proof of Theorem 7.2 gives a length L > 0, independent of to € J, 
such that the approximation U, defined by the obvious analogue of (7.3) con- 
verges to U weakly in L®([to, to + L],H*(M)). In particular, ||U.(t)||c2 is 
bounded on [t¢o, to + L]. On the other hand, there is also an analogue of (7.15), 
with @ replaced by @ + 1: 


d 
(7.42) qllUeO liver: < Cena (Ue @lle2) (Ve llirers + 1). 
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Consequently, U- is bounded in C([to, to + L}, H‘*1(M)), and we obtain 
(7.43) we L™({to,to + L],H“t'(M)) NM Lip([to, to + L], H**(M)). 
Since the exceptional set € has measure 0, this is enough to guarantee that 
(7.44) u € LS (J,H*!(M)) NM Lip,,.(J, H*1(M)), 


and since J is obtained by shrinking J as little as one likes, we have 
(7.44) with J replaced by J. Now we can iterate this argument, obtaining 
u € L& (I, H*4(M)) for each j € Z*, from which (7.38) is easily deduced. 


loc 
We can now sharpen the description (7.18) of the solution u in another fashion: 
Proposition 7.5. The solution u of Theorem 7.2 has the property that 
(7.45) ué€ C(I, H*(M)) NC" (I, H*-?(M)). 
Proof. It suffices to show that u(t) is continuous at t = 0, with values in H‘(M). 
We know that as t \, 0, u(t) is bounded in H(M) and converges to u(0) = f 


in H‘~?(M); hence u(t) + f weakly in H“ as t \, 0. To deduce that u(t) > f 
in H-norm, it suffices to show that 


(7.46) limsup ||u(t)|lze < || fllaze- 
t\o 


However, the bounds on ||w-(t)|| 4 implied by (7.15) easily yield this result. 


Now that we have smoothness, (7.38), an argument parallel to but a bit simpler 
than that used to produce (7.4)-(7.15) gives 


(7.47) Sut) < Ceo(llu(lle2) (Ilu(t) lire + 1), 


for a solution u € C%((0,7) x M) to (7.1). This implies the following persis- 
tence result: 


Proposition 7.6. Suppose u € C%((0,T) x M) is a solution to (7.1). Assume 
also that 


(7.48) llu(t)Iloz < K < 0, 


fort € (0,7). Then there exists T; > T such that u extends to a solution to (7.1), 
belonging to C® ((0,T1) x M). 
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A special case of (7.1) is the class of systems of the form 


(7.49) ou S- Al*(t,a,u) 0j0pu+ B(t,a,Dzu), u(0) =f. 
j,k 


3 2 


We retain the strong parabolicity hypothesis (7.2). In this case, when one does 
estimates of the form (7.5)-(7.15), and so forth, C?-norms can be systematically 
replaced by C!-norms. In particular, for a local smooth solution to (7.49), we have 
the following improvement of (7.47): 


d 
(7.50) qle@llire < Celle lle) (Ie@)llire + 1)- 
Thus we have the following: 


Proposition 7.7. If (7.49) is strongly parabolic and f € H*(M) with € > 
n/2 +1, then there is a solution u, on an interval I = |0,T), such that 


(7.51) u € C([0,T), H*(M)) NC™((0,T) x M). 
Furthermore, if 
(7.52) \|u(t)|\|or < K <0, 


fort € (0,7), then there exists T, > T such that u extends to a solution to (7.49), 
belonging to C® ((0,T,) x M). 


We apply this to obtain a global existence result for a scalar parabolic equation, 
in one space variable, of the form 


Ou 


(7.53) Bt 


= A(u)O2ut+g(u,ux), u(0) =f. 


Take M = S*. We assume g(u, p) is smooth in its arguments. We will exploit the 
maximum principle to obtain a bound on w,, which satisfies the equation 


a / 
(7.54) ot A(u) 02(uz) + A’(u)ue Ov (ux) 


+ Gp(u, Ux) Or (Ux) + Gulu, Uz )Ug- 


The only restriction on applying the maximum principle to estimate |u,.| is that 
we need g,,(u, Uz) < 0. We can fix this by considering e~'* u,., which satisfies 


0 


(735) ys (e 


Bip) = é (kh — Kuz), 
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where R is the right side of (7.54). The maximum principle yields 


(7.56) IIuelloo Se * Oxf loo, 
provided 
(7.57) Gu(U, Ur) < K. 


We have the following result: 


Proposition 7.8. Given f € H?(S*), suppose you have a global a priori bound 
(7.58) Ilu(t) ll < Ko, 


for a solution to the scalar equation (7.53). If there is also an a priori bound 
(7.57) (for |u| < Ko), which follows automatically in case g = g(u) is a smooth 
function of u alone, then (7.53) has a solution for all t € [0, 00). 


The class of equations described by (7.53) includes those of the form 


(7.59) a = 0,(A(u) 0,u) + y(u), u(0) = f. 


In fact, this is of the form (7.53), with 


(7.60) g(u, Ue) = A'(u)uz + p(w). 
Thus 
(7.61) Gulu, Uz) = A” (u)u2 + y'(u). 


In such a case, (7.57) applies if and only if A’’(u) < 0, that is, A(w) is concave 
in u. For example, Proposition 7.8 applies to the equation 


Ou 
(7.62) a Ap(ud,u) + (u), u(0) = f, 
in cases where it can be shown that, for some a,b € (0,00), a < u(t, x) < 6b for 
allt > 0, x € S. This in turn holds if f(a) takes values in the interval [a, b] with 
y(a) > 0 and y(b) < 0, by arguments similar to the proof of Proposition 4.3. A 
specific example is the equation 


ju afa 
(7.63) a == (ux) tu(l—u), u(0)=f, 


arising in models of population growth (see [Grin], p. 224, or [Mur], p. 289). This 
is similar to reaction-diffusion equations studied in § 4, but this time there is a 
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nonlinear diffusion as well as a nonlinear reaction. In this case we see that (7.63) 
has a global solution, given smooth f with values in a interval J = [a,b], with 
a> 0; u(t, x) € I for all (t,2) € R* x S' if also b > 1. This existence result is 
rather special; a much larger class of global existence results will be established 
in § 9. 


Exercises 


1. In the setting of Proposition 7.3, given u(0) = f € H‘(M), initial data for (7.1), work 
out estimates for 
I|u(t) — ue()llas(my, LS ISl-1. 


2. Establish global solvability on [0,00) x $* for 
(7.64) ap = Au) aut ou), u(0) =f, 


given f € H?(S"), under the hypotheses 
a<f(x)<b, vla)20, lb) <9, 


and 
A(u) >C>0, fora<u<ob. 


3. Establish global solvability on [0,00) x $* for 


(7.65) Ot _ A(us)ues, (0) = f, 
ot 
given f € H?(S'), under the hypothesis 
(7.66) A(p) > C > Oand A”(p) < 0, for |p| < sup |f’(2x)|. 


(Hint: The function v = uz satisfies 


Ov 
BY = a,(A(v) dev), 0(0) =F") 
Estimate vz = Ug2.) 
Consider the example 
Ou 2\1/2 
(7.67) a (l=) "tea, 
assuming | f’(a)| < a < 1, or the example 
Ou 21 
7. —=(1 LL» 
(7.68) BE (l+uz) ou 


assuming | f’(ax)| <b < \/1/3. 
A much more general global existence result is derived in § 9. See Exercise 3 of § 9 for 
a better existence result for (7.68). 
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8. Quasi-linear parabolic equations II (sharper estimates) 


While most of the analysis in § 7 was fairly straightforward, the results are not as 
sharp as they can be, and we obtain sharper results here, making use of paradif- 
ferential operator calculus. The improvements obtained here will be coupled with 
Nash—Moser estimates and applied to global existence results in the next section. 
Most of the material of this section follows the exposition in [Tay]. 

Though we intend to concentrate on the quasi-linear case, we begin with com- 
pletely nonlinear equations: 


0 
(8.1) 5p =F (be, Diu), u(0) =f, 


for u taking values in R“. We suppose F = F(t, x,¢), € = (Ca; : Jal < 2,1 < 
9 < K) is smooth in its arguments, and our strong parabolicity hypothesis is 


(8.2) -Re )) (OF /Oa)E* = ClEl?I, 


ja|=2 


for € € R”, where Re A = (1/2)(A+ A*), fora K x K matrix A. Using the 
paradifferential operator calculus developed in Chap. 13, § 10, we write 


(8.3) F(t, x, D2v) = M(v;t,2, D)v + R(v). 
By Proposition 10.7 of Chap. 13, we have, for r > 0, 
(8.4) u(t) € C77 => M(u;t,2,£) € AhS7, C C™S7N Shy, 


where the symbol class Aj.S7"; is defined by (10.31) of Chap. 13. The hypothesis 
(8.2) implies 


(8.5) —Re M(v;t, x, &) > Cl€|?I > 0, 

for |€| large. Note that symbol smoothing in «, as in (9.27) of Chap. 13, gives 
(8.6) M(v;t,x,€) = M*(t,x,€)+ M°(t,z, &), 

and when (8.4) holds (for fixed 1), 

(8.7) M* (t,2,€) € AjS?5, M(t,2,8€) € S77”. 

We also have 

(8.8) —Re M* (t,x, €) > Cl€|?I > 0, 


for |€| large. 
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We will obtain a solution to (8.1) as a limit of solutions u, to 


(8.9) it = J-F(t, ay) dete), ue(0) = f. 


Thus we need to show that u-(t, x) exists on an interval ¢ € [0, T’) independent of 
€ € (0, 1] and has a limit as « — 0 solving (8.1). As before, all this follows from 
an estimate on the H*-norm, and we begin with 


d 

10) glltue(Ili2 = 2M Jet, 2, D3 Jette), Mt) 
= 2(A° MeJeuc, A’ Jeue) + 2(A° Re, A’ Sete). 

The last term is easily bounded by 


C(lue(t)| 22) [Il Jewe(t) lls + 1). 
Here M. = M(J-u.;t, x, D). Writing M. = M# + M? as in (8.6), we see that 


(ASM? Jee, AS Jette) 
(8.11) (AP ME Tie A Lae) 
< C([Jete||c2tr) [Fete] gs+1-rs || Fete || s+, 


for s > 1, since by (8.7), M? : H*+1-"§ —+ H*—1. We next estimate 


(A°*M# Jette, AM Jue) 


(8.12) 
= (M#A8 Jeuc, AS Jet) + ([A°, MP] Jeu, Ao Jette). 


By (8.7), plus (10.99) of Chap. 13, we have [A*%, Mz*] € Ors.” fO<r<l, 
so the last term in (8.12) is bounded by 


CATE MF J tie KO Tae) 


(8.13) 
< C([|uello2+~) 


| JeUe|| stir + || Jette || ze+1. 
Finally, Garding’s inequality (Theorem 6.1 of Chap. 7) applies to M?: 
(8.14) (MFw, w) < —Collullin + Cr ([luellor+) ellz2- 


Putting together the previous estimates, we obtain 


d 1 
(8.15) Flluc(t)lli« < —ZCol|Jevellize+s + C(||Uellor++)|[Jetelljret—rs, 
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and using Poincaré’s inequality, we can replace —Co/2 by —Co/4 and the 
H**1-'° norm by the H®-norm, getting 


d 1 
“lite Ct) libre S — ZCol Jere (€)Il3re 


+ C"(|[ue(#)\lo2+r) | Fete (t) [lire 


(8.16) 


From here, the arguments used to establish Theorem 7.2 through Proposition 7.6 
yield the following result: 


Proposition 8.1. [f(8.1) is strongly parabolic and f © H*(M), with s > n/2+2, 
then there is a unique solution 


(8.17) u € O((0,T), H°(M)) NC™((0,T) x M), 
which persists as long as ||u(t)||c2+r is bounded, given r > 0. 


Note that if the method of quasi-linearization were applied to (8.1) in concert 
with the results of §7, we would require s > n/2 + 3 and for persistence of the 
solution would need a bound on ||u(¢)||c3. 

We now specialize to the quasi-linear case (7.1), that is, 


(8.18) 


= 0 A*(t,2, Diu) 0;0,u+ Bit,x, Diu), u(0) = f. 
ik 


This is the special case of (8.1) in which 

(8.19) F(t,x, D2u) = S~ AM* (t,x, Diu) 0;0.u + B(t, ©, Diu). 

We form M(v; t,x, D) as before, by (8.3). In this case, we can replace (8.4) by 
(8.20) v EC" —> M(v;t,2,€) € ApSi1 + Sia 


Thus we can produce a decomposition (8.6) such that (8.7) holds for v € C!+", 
Hence the estimates (8.11)-(8.16) all hold with constants depending on the 
C'*"-norm of uz(t), rather than the C?*”-norm, and we have the following 
improvement of Theorem 7.2 and Proposition 7.6: 


Proposition 8.2. If the quasi-linear system (8.18) is strongly parabolic and f € 
H*(M), s > n/2 + 1, then there is a unique solution satisfying (8.17), which 
persists as long as ||u(t)||c1+r is bounded, given r > 0. 


We look at the parabolic equation 


(8.21) a = S > APF (t,x, u) 0;O,u+ B(t,x,u), 
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which is a special case of (8.1), with 

(8.22) F(t, 2, Dau) = 5 > AP* (t,x, u) 0;0,u+ B(t, 2, u). 

In this case, if r > 0, we have 

(8.23) v€C" => M(v;t, 2,6) € Aj S71 + Sia 3 

and the following results: 

Proposition 8.3. Assume the system (8.21) is strongly parabolic. If f © H*(M), 


s > n/2 +1, then there is a unique solution satisfying (8.17), which persists as 
long as ||u(t)||cr is bounded, given r > 0. 


It is also of interest to consider the case 
(8.24) ei Sa; AM*(t,2,u) Ou, u(0) = f. 
Ot 9] es) ’ 
Arguments similar to those done above yield the following. 
Proposition 8.4. If the system (8.24) is strongly parabolic, and if f © H*(M), 
s > n/2 +1, then there is a unique solution to (8.24), satisfying (8.17), which 


persists as long as ||u(t)||cr is bounded, for some r > 0. 


We continue to study the quasi-linear system (8.18), but we replace the strong 
parabolicity hypothesis (7.2) with the following more general hypothesis on 


(8.25) La(t,v, 2,8) =— > A* (t,x, v)esen; 
jk 
namely, 
(8.26) spec Lo(t,v,2,€) C{z €C: Rez < —Colé|*}, 


for some Cg > 0. When this holds, we say that the system (8.18) is Petrowski- 
parabolic. Again we will try to produce the solution to (8.18) as a limit of 
solutions w- to (8.9). In order to get estimates, we construct a symmetrizer. 


Lemma 8.5. Given (8.26), there exists Po(t,v, x, €), smooth in its arguments, for 
€ # 0, homogeneous of degree 0 in €, positive-definite (i.e., Py > cl > 0), such 
that —(PoL2 + L3 Pp) is also positive-definite, that is, 


(8.27) —(PoL2 + L3Po) > Clé/?I > 0. 
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Such a construction is done in Chap.5. We briefly recall the argument used 
there, where this result is stated as Lemma 11.5. The symmetrizer Po, which is 
not unique, is constructed by establishing first that if D2 is a fixed K x K matrix 
with spectrum in Re z < 0, then there exists a K x K matrix Po such that Po 
and —(P)L2 + L4 Po) are positive-definite. This is an exercise in linear algebra. 
One then observes the following facts. One, for a given positive matrix Po, the 
set of Lz such that —(PoL2 + L4 Po) is positive-definite is open. Next, for given 
Lz with spectrum in Re z < 0, the set {Py : Py > 0,—(PoL2 + L3Py) > 0} 
is an open convex set of matrices, within the linear space of self adjoint K x K 
matrices. Using this and a partition-of-unity argument, one can establish the fol- 
lowing, which then yields Lemma 8.5. (Compare with Lemma 11.4 in Chap. 5. 
Also compare with the construction in § 8 of Chap. 15.) 


Lemma 8.6. If Mj, denotes the space of real K x K matrices with spectrum in 
Re z < Oand Pe the space of positive-definite (complex) KX x IK matrices, there 
is a smooth map 

®: My — Pk, 


homogeneous of degree 0, such that if L € Mj, and P = ®(L), then —(PL + 
L*P) € PE. 


Having constructed Po(t,v, x, &), note that, for fixed t, r € R, 


(8.28) ue CO" => L(t, Dru,2,€) © CLS% and 
Po(t, Dzu, x, €) € CTS%. 

Now apply symbol smoothing in x to Po(t, o£) = Pot, Diu, x, €), to obtain 
(8.29) P(t) € OPAGS? 5; P(t) — P(t, Diu, x, D) € OPCS. 
Then set 


(8.30) Q= S(P+P*)+ KA, 


with K > 0 chosen so that Q is positive-definite on L?. Now, with u, defined as 
the solution to (8.9), u-(0) = f, we estimate 


: (A*ue, Q-A*ue) = 2(A*O;ue, Q-A*ue) ot (APU, PAP ie); 


(8.31) er 


where P. is obtained as in (8.29)-(8.30), from symbol smoothing of the family 
of operators P: = Po(t, Di J-u-,x,D), and Q- comes from P- via (8.30). Note 
that if u_(t) is bounded in C1*"(M), then P!(t) is bounded in OPS?;"(M), so 


(8.32) (AP ue, PLAS ue)| < C(|lue() lor) [lue(t) Fet1—r2- 
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We can write the first term on the right side of (8.31) as twice 
(8.33) (QeASJeMz Jette, A®te) + (QeA' Re, A°te), 
where M, is as in (8.10). The last term here is easily dominated by 
(8.34) C((fere(t)||o») || ete (t) || rot + lltue(t) lee. 
We write the first term in (8.33) as 


(Q-M.M Jue, Af Jeue) + (Q- [A*, M_| Jee, AP JU) 


8.35 
+ ((Q-A*, Je|MeJeuc, Mute). 


We have Q.(t) € OPAGS) 5, by (10.100) of Chap. 13, and hence, by (10.99) of 
Chap. 13, 


(8.36) [Q-A‘, J-] bounded in £(H*~", L), 
with a bound given in terms of ||w-||c1+- if r > 1. Furthermore, we have 
(8.37) || Me Jete|| 2-1 S< C([|Uellor+r) [Jet lle, 
so we can dominate the last term in (8.35) by 
(8.38) C(Ilue (t) lore) || Jette] zrs+1 - |[Uellazs, 
provided r > 1. Moving to the second term in (8.35), since 
M, € OPA§S7, + OPST7", 
we have 


(8.39) I|[A*, MeJu||z2 < C(|\uel|cr+r) 


|u|| ast, 


provided r > 1. Hence the second term in (8.35) is also bounded by (8.38). 
This brings us to the first term in (8.35), and for this we apply the Garding 
inequality to the main term arising from M. = M# + M?, to get 


(8.40) (Q-M.v,v) < —Collullza + C(|luellc2)llollZ2- 


Substituting v = A*J-u, and using the other estimates on terms from (8.31), we 
have 


d 
—(A% AS <_ 2 me 
(8.41) at Ue Qe Uz) = Co|| Jette lz + 


+ C(||uellca+5)||Uell xe [|| Jettel] zro42 + [luc lla] 
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which we can further dominate as in (8.16). Note that (8.32) is the worst term; we 
need r > 2 for it to be useful. 

From here, all the other arguments yielding Propositions 8.1 and 8.2 apply, and 
we have the following: 


Proposition 8.7. Given the Petrowski-parabolicity hypothesis (8.25)-(8.26), if 
f € H°(M) and s > n/2 +3, then (8.18) has a unique solution 


(8.42) ué€ C((0,T), H°(M)) A C™((0,T) x M), 

for some T > 0, which persists as long as ||u(t)||c1+r is bounded, for some r > 0. 
In order to check the persistence result, we run through (8.31)-(8.41) with 

ue replaced by the solution u, and with J, replaced by J. In such a case, the 

analogue of (8.32) is useful for any r > 0. The analogue of (8.36) is vacuous, 

so (8.38) works for any r > 0. An analogue of (8.11)-(8.13) can be applied to 


(8.39); recalling that this time we have (8.7) for u € C!+" we also obtain a useful 
estimate whenever r > 0. This gives the persistence result stated above. 


Exercises 


In Exercises 1-10, we look at the system 


a = MAu-—aV- (uVv), 
ot 

(8.43) 
Ov _ Barwa bu n 
Ot "uth oe 


We assume that M, D, ju, a, b, and h are positive constants, and A is the Laplace oper- 
ator on a compact Riemannian manifold. This arises in a model of chemotaxis, the 
attraction of cells to a chemical stimulus. Here, u = u(t, x) represents the concen- 
tration of cells, and v = u(t, x) the concentration of a certain chemical (see [Grin], 
p. 194, or [Mur]). 

1. Show that (8.43) is a Petrowski-parabolic system. 

2. If (u,v) is a sufficiently smooth solution for t € [0, 7), show that 


(8.44) u(0) > 0, v(0) > 0 = u(t) > 0, v(t) >0, Vte (0,7). 


(Hint: If we can deduce u(t) > 0, the result follows for 


t 
u(t) = e(PA-Hty(Q) +f e(PA-W(t-7) (u(r) dr, y(u)= bu ; 
0 uth 


Temporarily strengthen the hypothesis on u to u(0, 2) > 0, and modify the first equa- 
tion in (8.43) to 
ut = MAu — aV- (uVv) +¢, 


with small « > 0. Show that u(t, x) > 0 for ¢ in the interval of existence by consid- 
ering the first to at which, for some zo € M, u(to, 20) = 0. Derive the contradictory 
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estimate O;u(to, 20) > €. To pass from the modified problem to the original, you may 
find it necessary to work Exercises 3-10 for the modified problem, which will involve 
no extra work.) 

3. Show that ||u(t)||z1(az) is constant, for ¢ € [0, T). (Hint: Integrate the first equation 
in (8.43) over x € M, and use the positivity of wu.) 
Note: The desired conclusion is slightly different for the modified problem. 

4. Given the conclusion of (8.44), show that, for J = [e,T), r € (0,1), 


(8.45) sup ||v(€)||c1tr(ay < 00. 
tel 


(Hint: Regard the second equation in (8.43) as a nonhomogeneous linear equation for 
v, with nonhomogeneous term F(t, x) = bu/(u+h) € L° (I x M).) 
5. Show that, for any 6 > 0, 


(8.46) ap \|u(t) ll z1-8.1 ar) < 00. 
te 


(Hint: Regard the first equation in (8.43) as a nonhomogeneous linear equation for 
u, with nonhomogeneous term G(t,z) = aV- H(t,x), where H = uVvu € 
L™ (I, L'(M)).) 

6. Given (8.45)-(8.46), deduce that H € L©(I,H"™'(M)), for any r € (0,1). Hence 
improve (8.46) to 


(8.47) sup ||u(t)||172-6.1 (a1) < 09, 
tel 


for any 5 > 0. Consequently, for p € (1,n/(n — 1)), 


(8.48) sup Iu (t) ll z1.2(ar) < 00. 
te 


7. Now deduce that H € L®(I,H™?(M)), for any r € (0,1), p € (1,n/(n — 1)). 
Hence improve (8.48) to 


(8.49) sup ||u(t)||z2-s.e(ar) < 00. 
tel 
8. Iterate the argument above, to establish (8.49) for all p < oo, hence 


(8.50) sup |lu()[lor+r(as) < 00, 
te 


for any r < 1. 
9. Using (8.50), improve the estimate (8.45) to 


sup ||v(t)||os+r(my) < ©, 
tel 
for any r < 1. Then improve (8.50) to sup; ||u(t)||c2+r(a) < 00, and then to 
sup ||u(t)||o3+r(m) < 00. 
tel 


10. Now deduce the solvability of (8.43) for all t > 0, given (8.44). 
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In Exercises 11-12, we look at a strongly parabolic K x K system of the form 
(8.51) ut = A(u)tee +g(u,ux), «E S’. 
11. Ifu€ C™((0,T) x M) solves (8.51), show that 


{ ae()Il2a¢s%) < —Co|luex(t)||Z2 + 2] 9(u(t), ue(t)) || p2lluen(t)Iln2 


x 


(8.52) 


x 


<P use (lie + Gllalu(t). te) Mize 


where 2A(w) > Co > 0. 
12. Suppose you can establish that the solution u possesses the following property: For 
eacht € (0,7), ||u(t,-)||z~ < C1 < 00. Suppose 


(8.53) lg(v, p)| < C2(1 + |p), 
for |v| < C1. Show that u extends to a solution u € C™((0, 71) x M) of (8.51), for 
some 7; > T’. (Hint: Use Proposition 8.3.) 
In Exercises 13-15, we look at a strongly parabolic kK x K system of the form 


(8.54) - = 0,A(u)O2u+ f(u), «ES. 


13. Ifu € C™((0,T) x M) solves (8.54), show that 


d 


a lux (t)IlZ2(s1) < (Bllu(E)|lz~° — Co)||uee(t)|lz2 


(8.55) 
+ 2l|f'(u(t)) || poo llue()Ilz2, 


where 
2A(u) >Co>0, 3 =sup 6||DA(u)]). 


(Hint: Use the estimate 
|luellZ4 < 3llulle~[|Ozullz2, 


which follows from the p = 1, k = 2 case of Proposition 3.1 in Chap. 13.) 
14. Improve the estimate (8.55) to 


d , 
(8.56) alle lize < (BN (u) — Co) ||ueallZ2 + 2I| f’(w)|| 2 lluellZ2, 
where 
. 1 
(8.57) N (9) = inf lg — Allzco(st) = 9 0809: 


15. Suppose you can establish that the solution u possesses the following property: 
For each t € (0,7), u(t,-) takes values in a region K, C R™ so small that 
N(u(t)) < Co/@. Assume |lv|| < Ci < ov, for v € Ky. Show that u extends 
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to a solution u € C® ((0, Ti) x M) of (8.54), for some T, > T. 
Compare with the treatment of (7.59). See also the treatment of (9.61). 
16. Rework Exercise 12, weakening the hypothesis (8.53) to 


1 
(8.58) |9(v, p)| < Ca(1 + |p|) +aCo|p?, a< om 


for |u| < C1. 


9. Quasi-linear parabolic equations III 
(Nash—Moser estimates) 


We will be able to get global solutions to a certain class of quasi-linear parabolic 
equations by applying the results of § 8 together with Hélder estimates for solu- 
tions to scalar equations of the form 


(9.1) —--Lu=0, Lu=b"' 5° 60;(a'*b O,u), 
jk 


where a?*,b,b~! € L®. The operator L is as in (9.1) of Chap. 14, and we make 
the same ellipticity hypothesis as used there; thus we assume 


(9.2) do 9 GS So a(t, abbr SSE, bo S d(z) Sh, 
with 
(9.3) O< Ap <A <0, O< bo <b < Ow. 


We take b independent of t. Hélder estimates for solutions to (9.1) under these 
hypotheses were first proved by Nash [Na]. Moser [Mos2] established a Harnack 
inequality that yielded such Holder estimates; a simpler proof is given in [Mos3]. 
Another treatment of Nash’s results has been given in [FS]. All these arguments 
are more elaborate than that used for elliptic equations in Chap. 14, partly because 
they produce a sharper sort of Harnack inequality. Here, we follow [Kru], who 
obtained a parabolic analogue of the weaker Harnack inequality discussed in 
Chap. 14, by methods parallel to those in Moser’s first treatment of the elliptic 
case, in [Mos1]. 

As in § 9 of Chap. 14, which we will refer to as “14” for short, we use a?* to 
define an inner product of vectors in R”: 


(9.4) (V,W) = So Vj07*Wi,; 
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we use the square norm |V|? = (V,V); and we use b dx = dV to define the 
volume element. Parallel to (9.4) of “14,” we have 


(9.5) v= f(u) => (@—L)v= f'(u)(Q, — L)u— f"(u)|Vul?. 


We say v is a subsolution of 0; — L provided (0, — L)v < 0. Thus we see that 
ut» f(u) takes solutions to (0; — L)u = 0 to subsolutions if f is convex, while 
it takes subsolutions to subsolutions if f is both convex and increasing. 

Next, parallel to (9.3) of “14,” we have 


(9.6) J [wo — L)udt dV = [f (v2, Vou) dt dV + - wo;u dt dV, 
Q Q Q 


where Q = I x 0 = [T,, T] x Q and w vanishes near J x OQ. If we set w = 
wu, where w(t, x) is C°° and vanishes for x near OQ, we obtain the following 
analogue of (9.5) of “14”: 


i w?|V,ul? dt dV 
(9.7) = -2 ff wan, uV zw) dt dV +f wgu dt dV 
+ [flaw yw dt dV — 5 feet, a) dV + 5 f eu(ti.a) dV, 


provided (0, — L)u = g. Consequently, parallel to the estimate (9.6) of “14,” we 
have 


5 ff vveuP dt dV + 5 [Wut 2)? dV 
(9.8) < 2 ff w([Veol? a 5a?) dt dV 
+ [f Pou dt dV + 5 [ u(t, 2y dV. 


We now proceed to a Moser iteration argument, parallel to (9.7)—(9.20) of “14.” 
Given @ = I x Q), consider nested sequences of regions 2 = Q9 D ++» D 0; D 
Qj41 D --- in R” and intervals [ = I9 D +--+. D J; D Ij41 D ---, with 
intersections © and J, respectively, so we have Q; = Ij x Q3 \ Q =JxO 
(see Fig. 9.1). Let us assume J = [0,T] and I; = [7,7], with J = [1'/2,T]. 
We suppose that the distance of any point in 00; +1 to OQ; is ~ j~* and that the 
length of I; \ [j41 is ~ j~*. We want to estimate the sup norm of a function v on 


Q in terms of its L?-norm on Q, assuming 


(9.9) v>0 and (0, —L)v <0. 
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FIGURE 9.1 Setup for Moser Iteration 


In view of (9.5), an example is 
(9.10) v= (1402), [Tu=0. 


We will obtain such an estimate in terms of certain Sobolev constants, y(Q,) and 
C;, arising in the following two lemmas, which are analogous to Lemmas 9.1 and 
9.2 of “14.” 


Lemma 9.1. For sufficiently regular v defined on Q,, and with & < n/(n — 2), 
we have 


Io" Bacay) $ 123) (05(0)"*IVal89,) + 25(0)"); 


(9.11) 
o;(v) = sup ||v(t)||Z2(a,)- 
tel; 


Proof. This is a consequence of the following slightly sharper form of (9.10) 
in “14”: 


K 2(K-1 K 
9.12) [Bacay £125) (lIVerllaca,yllollza> + leoll2(a,))- 


Indeed, integrating (9.12) over t € I; gives (9.11). 
Next, we have 


Lemma 9.2. [fv > 0 is a subsolution of 0, — L, then, 


(9.13) Vc] 22(Q541) + oe lo) 22 54.1) < C3\lvllz2(@,); 
j+1 


where C; = C(Q;,Q541). 


Proof. This follows from (9.8), with u = v, if we let T; = 7;, pick » = 
p;(x)n,;(t), with y; (2) = 0 for x near 00,;, while yj (x) = 1 for x € 0541, 
and 7;(7;) = 0, while ;(t) = 1 for t € I;41. Then let T> run over [7)41, T]. 
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We construct the functions y,; and 7; to go from 0 to 1 roughly linearly, over a 
layer of width ~ Cj~?. As in (9.12) of “14,” we can arrange that 


(9.14) V(Q5) <0, Ci < Coj? +1). 
Putting together these two lemmas, we see that when v satisfies (9.9), 


(9.15) lo E2(Q541) S 0(CF* + Yilell73.a,)- 


Now, if v satisfies (9.8), so does v; = ye’, by (9.5). Note that vj.1 = UF - From 
here, the estimate on 


F 1/KI 
(9.16) lel.) S Himsup [lel 209,) 
jmoo 


goes precisely like the estimates on (9.16) in “14,” so we have the sup-norm 
estimate: 


Theorem 9.3. [fv > 0 is a subsolution of 0, — L, then 
(9.17) lollg=@) < Kellan 


where K = K(74,Co,). 


Next we prepare to establish a Harnack inequality. Parallel to (9.24) of “14,” 
we take w = w? f’(u) in (9.6), to get 


[fer woiveur dt av + f vs (u(ds,2)) dV 
(9.18) = =2 ff (oF! (w) Vou, Vor) dt dV +2 ff wav) fu) dt dV 
+ f PF(u(T,2)) dV 


if (O, — L)u = 0 on [T1, To] x Q, and w(t, x) = 0 for x near OQ. If we pick f(u) 
to satisfy the differential inequality 


(9.19) fl (u) > f'(uy?, 
we have from (9.18) that 
5 ff ver w)lVaul? at av + fv? F(u(Tp,2)) av 
(9.20) < 2 ff |Vi~|? dt av +4 f wuz f(u) dt dV 
+ [vs (u(t,2) av, 
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provided (0; — L)u = 0 on [T,, To] x 0. Since f”(u)|Veul? > f’(u)?|Veul? = 
|V..v|?, we have 


5 ff vlveur dt av + [ vo(Ta,2) dV < 
2// |Vevl2 dtdV +4 | | ddyv dt dV + | W?v(Ty, 2x) av. 
I I : 


If we take w(t, x) = v(x) € CG°(Q), we have 


(9.21) 


1 

5 | [evel at av + [ g*o(Te,2) dV 
(9.22) 

< 2(T> = T1) / lV2v|" dV + / y*u(Th, x) dv. 


We will apply the estimate (9.22) in the following situation. Suppose (0; — 
L)u =0o0nQ = (0,T] x 2, u > 00nQ, and 


1 
(9.23) meas{(t,7) € Q: u(t,x) > 1} > 3 meas Q. 
Let 2 be a ball in R” and O a concentric ball, such that 
3 
(9.24) meas O > Z meas (2. 


Here, dV = b dz is used to compute the measure of a set in R”. Given h > 0, let 
(9.25) Or(h) ={x Ee O:ul(t,2z) >h}, O(h) = {x €D: ult, x) > h}. 
Pick y € C§°(Q) such that y = 1 on O, and set 


1 
uth’ 


(9.26) v = f(u) = logt 


Note that f satisfies the differential inequality (9.19), and f(u) = 0 for u > 1. 
From the hypothesis (9.23), we can pick T; € (0,7) such that 


1 
(9.27) meas Q7, (1) > 5 meas 2. 


We let ¢ be any point in (7), T] and apply (9.22), with T, = t (discarding the first 
integral). Since v > log(1/2h) for x € O \ O;(h) while v < log(1/h) on Q and 
v = 0 onat least half of 2, we get 


1 


(9.28) (log =) meas(O \ O;(h)) < K+ 5 (los A 


) meas 2), 
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with ic independent of h. In view of (9.24), this implies 


1 kK 
meas {) — 


1 
(9.29) meas O;(h) > ri Ia XW 


where ; — 
ey mee 08 
Mh) = logs, 5(h) 


~ log i 
Taking h sufficiently small, we have 
Lemma 9.4. [fu > 0 on Q = [0,7] x 2 satisfies (0, — L)u = 0, then, under 


the hypotheses (9.23) and (9.24), there exist h > 0 and T, < T such that, for all 
te (Th, Th, 


1 
(9.30) meas {x €O:u(t,x) > h} > 5 meas O. 


We are now ready to prove the following Harnack-type inequality: 


Proposition 9.5. Let u > 0 be a solution to (0, — L)u = 0 on Q = [0,T] x Q, 
where Q) is a ball in R” centered at xo. Assume that (9.23) holds. Then there is 
a concentric ball Q, a number T < T, and « > 0, depending only on Q and the 
quantities \;,b; in (9.2), such that 


(9.31) u(t,c)>« on[r,T]) xQ=Q. 


Proof. Pick 7 € (71,7), and let Qo = [7,T] x O. We will apply (9.22), with 
the double integral taken over Qo, and with 


h 
ute 


(9.32) v = f(u) = logt 


Here, h is as in (9.30), and we will take € € (0,h/2]. With » € C§°(O), (9.22) 
yields 


1 h 
(9.33) 5 // y?|Vevl|? dtdV < K+ [ gu(r0,2) dV <K+C,log = 
Qo oO 


Now v = f(u) = 0 for u > h, hence on the set O,(h), whose measure was 
estimated from below in (9.30). Thus, for each t € [7, T], 


(9.34) fea) dV < Cy iene x)|° dV 


oO 6) 
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if we take © to be a ball concentric with O, such that meas O > (9/10) meas O. 
We make y = 1 on O and conclude that 


h oe 
(9.35) ffe dt dV <03+Cylog=, R= [70,7] x 0. 


Since the function _f in (9.32) is convex, we see that Theorem 9.3 applies to v. 
Hence we obtain Q C FR such that 


3h 
9.36 x <C < Clog —. 
(9.36) lll S Cllellz2qe) $ Clog ~ 
Now, if we require that € € (0, h/ 2] and take ¢ sufficiently small, this forces 
(9.37) u>ev?—-e onQ, 


and the proposition is proved. 


We now deduce the Hdlder continuity of a solution to (0, — L)u = 0 on 
Q = [0,7] x Q from Proposition 9.5, by an argument parallel to that of (9.33)— 
(9.39) of “14.” We have from (9.17) a bound |u(t, x)| < A on any compact subset 
Q of (0, T] x Q. Fix (to, zo) € Q, and let 


(9.38) w(r) = sup u(t, x) — inf u(t, x), 
By r 
where 
(9.39) B, = {(t,2):0<tp—t < ar’, |x —29| < ar}. 


Say B, C Q for r < p. Clearly, w(p) < 2K. Adding a constant to u, we can 
assume 


1 
(9.40) sup u(t, x) = — inf u(t, x) = 50) = M. 


Pp 


Then u4 = 1+u/M and u_ = 1—wu/M are annihilated by 0, — L. They are both 
> 0 and at least one of them satisfies the hypotheses of Proposition 9.5 after we 
rescale B,, dilating x by a factor of p | and t by a factor of p~?. If, for example, 
Proposition 9.5 applies to u,, we have u4(t,z) > « in Bop, for some o € (0, 1). 
Hence w(ap) < (1 — K/2)w(p). Iterating this argument, we obtain 


(9.41) w(orp) < (1- 5) w(o), 
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which implies Hélder continuity: 
(9.42) w(r) <Cr®, 
for an appropriate a > 0. We have proved the following: 


Theorem 9.6. [fu is a real-valued solution to (9.1) on I x Q, with I = |0,T), 
then, given J = [To,T), To € (0,T), O CC Q, we have for some fs > 0 an 
estimate 


(9.43) lellencrx@y S Cllullz2(rxa); 


where C depends on the quantities A;, bj in (9.2), but not on the modulus of con- 
tinuity of a)* (t,x) or of b(t, x). 


Theorem 9.6 has the following implication: 


Theorem 9.7. Let M be a compact, smooth manifold. Suppose u is a bounded, 
real-valued function satisfying 


(9.44) a = div(A(t,x) grad u) 


on |to,to + a] x M. Assume that A(t, x) € End(T,M) satisfies 


(9.45) Nolé? < (AE, ZE,E) < Arlél?, 


where the inner product and square norm are given by the metric tensor on M. 
Then u(to + a,x) = w(x) belongs to C™(M) for some r > 0, and there is an 
estimate 


(9.46) |wllor < K(M, a, Xo, A1)||u(to, +) IL ze. 


In particular, the factor K (M, a, Ao, A1) does not depend on the modulus of con- 
tinuity of A. 


We are now ready to establish some global existence results. For simplicity, we 
take M = T”. 


Proposition 9.8. Consider the equation 
(9.47) ae S > aj; AM*(t,2,u) Opu, u(0) = f. 
Ot J a) ’ 


Assume this is a scalar parabolic equation, so a)* = AJ*(t, x, u) satisfies (9.2), 
with X; = A;(u). Then the solution guaranteed by Proposition 8.4 exists for all 
t>0. 
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Proof. An L°-bound on u(t) follows from the maximum principle, and then 
(9.46) gives a C’-bound on u(t), for some r > 0. Hence global existence follows 
from Proposition 8.4. 


Let us also consider the parabolic analogue of the PDE (10.1) of Chap. 14, 
namely, 


fa) 
(9.48) HT =~ A* (Vu) jeu, u(0) = f, 
with 

(9.49) At*(p) = Foyp, (D)- 


Again assume wu is scalar. Also, for simplicity, we take 14 = T”. We make the 
hypothesis of uniform ellipticity: 


(9.50) AilE? < >> Fojpn (PEER < Adlél?, 


with 0 < A, < Ag < oo. Then Proposition 8.2 applies, given f € H*(M), 
s > n/2 +1. Furthermore, ue = Ocu satisfies 
Oug -_ 


(9.51) a 


S "0; A7*(Vu) Opue, ue(0) = fe = Oef. 

The maximum principle applies to both (9.48) and (9.51). Thus, given u € 
C((0, T], H*) 9 C™((0,T) x M), 

(9.52) ju(t,2)| <[lflln~, [welt 2)| < lIfellu~, OSt<T. 

Now the Nash—Moser theory applies to (9.51), to yield 


(9.53) llue(t, Ilona) < K, O<t<T, 


for some r > 0, as long as the ellipticity hypothesis (9.50) holds. Hence again we 
can apply Proposition 8.4 to obtain global solvability: 


Theorem 9.9. If F'(p) satisfies (9.50), then the scalar equation (9.48) has a solu- 
tion for allt > 0, given f © H*(M), s>n/2+1. 


Parallel to the extension of estimates for solutions of Lu = O to the case 
Lu = f made in Theorem 9.6 of Chap. 14, there is an extension of Theorem 9.6 
of this chapter to the case 


Ou 
(9.54) ap Leth 


where L has the form (9.1). 
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Theorem 9.10. Assume u is a real-valued solution to (9.54) on I x Q, with 


n 
(9.55) sup ||f()Ilzeq) < Ko, p> =. 
tel 2 


Then u continues to satisfy an estimate of the form (9.43), with C also depending 
on Ko. 


It is possible to modify the proof of Theorem 9.6 in Chap. 14 to establish this. 
Other approaches can be found in [LSU] and [Kry]. We omit details. 


With this, we can extend the existence theory for (9.47) to scalar equations of 
the form 


a ‘s 
(9.56) aa = )~d;A** (t,2,u) ut yu), u(0) =f. 


An example is the equation 


(9.57) AG | tu(l—u), u(0)=f, 


the multidimensional case of the equation (7.63) for a model of population growth. 
We have the following result: 


Proposition 9.11. Assume the equation (9.56) satisfies the parabolicity condition 
(9.2), with Xj = Aj; (wu). Suppose we have a, < a2 inR, with p(a1) > 0, paz) < 
0. If f € C(M) takes values in the interval [a,, a2], then (9.56) has a unique 
solution u € C®([0,00) x M). 


Proof. The local solution u € C™({0,T) x M) given by Proposition 8.3 has the 
property that 


(9.58) u(t,z) € [a1,a2], Vtel0,T), re M. 
With this L°°-bound, we deduce a C’-bound on u(t), from Theorem 9.10, and 
hence the continuation of u beyond t = T, for any T’ < oo. 

To see that (9.58) holds, we could apply a maximum-principle-type argument. 
Alternatively, we can extend the Trotter product formula of §5 to treat time- 


dependent operators, replacing L by L(t). Then, for ¢t € [0,T), 


veo u(t) = lim S(t, tra) Fl S(t, FI" f, 
where t; = (j/n)t, S(t, s) is the solution operator to 


a 
(9.60) Or = S- 0; A?* (t,x, u(t,x)) O,v, S(t,s)v(s) = v(t), 
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and F' is the flow on R generated by y, viewed as a vector field on R. In this case, 
Fl” and (t;41,t;) both preserve the class of smooth functions with values in 
[ay 5 ag] < 


We see that Proposition 9.11 applies to the population growth model (9.57) 
whenever a1 € (0, 1] and az € [1, 00). 

We now mention some systems for which global existence can be proved via 
Theorems 9.6-9.10. Keeping M = T”, let u = (u,..., ue) take values in R®, 
and consider 


Ou 0 Ou 
(9.61) - yy oa, Te SE 
where X is a vector field in R‘ and D(w) is a diagonal ¢ x @ matrix, with diagonal 
entries d, € C™(R’*) satisfying 
(9.62) d,(u) >0, VueR*. 
We have the following; compare with Proposition 4.4. 


Proposition 9.12. Assume there is a family of rectangles 


(9.63) K, = {v € RB’: a;(t) < vj < bj(),1< 57 < 4 
such that 
(9.64) Fx(Ks) C Ksat, 8,tER, 


where F¥. is the flow on R° generated by X. If f € C°°(M) takes values in Ko, 
then, under the hypothesis (9.62) on the diagonal matrix D(u), the system (9.61) 
has a solution for allt € R*, and u(t, x) € Ky. 


Proof. Using a product formula of the form (9.59), where S(t, s) is the solution 
operator to 


dv (Diu) 


0.65) = 5e,)? Sth s)vls) = v0), 


and F' = FX, we see that if u is a smooth solution to (9.61) for t € [0, 7), then 
u(t,a) € K; for all (¢, 7) € [0, 7) x M, provided f(x) € Ko for alla € M. This 
gives an L*-bound on u(t). Now, for 1 < k < @, regard each u, as a solution to 
the nonhomogeneous scalar equation 


0.66) Se = 2 az; (te) 55) +Fh, Fe(t,c) = X;(u(t,2)). 
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We can apply Theorem 9.10 to obtain Holder estimates on each u;. Thus the 
solution continues past t = T’, for any T’ < oo. 


Exercises 


1. Show that the scalar equation (9.56) has a solution for all t € [0,00) provided there 
exist C, M € (0,00) such that 


u>M=>y(u)<Cu, u<-M>= v(u) > —-Clul. 


2. Formulate and establish generalizations to appropriate quasi-linear equations of results 
in Exercises 2-6 of § 4, on reaction-diffusion equations. 
3. Reconsider (7.68), namely, 


(9.67) = =(1+uzZ) ‘use, u(0,2) = f(x). 


Demonstrate global solvability, without the hypothesis | f’(x)| <b < ,/1/3. More 
generally, solve (7.65), under only the first of the two hypotheses in (7.66). 
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Nonlinear Hyperbolic Equations 


Introduction 


Here we study nonlinear hyperbolic equations, with emphasis on quasi-linear 
systems arising from continuum mechanics, describing such physical phenom- 
ena as vibrating strings and membranes and the motion of a compressible fluid, 
such as air. 

Sections |—3 establish the local solvability for various types of nonlinear hyper- 
bolic systems, following closely the presentation in [Tay]. At the end of § 1 we 
give some examples of some equations for which smooth solutions break down in 
finite time. In one case, there is a weak solution that persists, with a singularity. 
This is explored more fully later in the chapter. 

In § 4 we prove the Cauchy—Kowalewsky theorem, in the nonlinear case, using 
the method of Garabedian [Gb2] to transform the problem to a quasi-linear, sym- 
metric hyperbolic system. 

In § 5 we derive the equations of ideal compressible fluid flow and discuss some 
classical results of Bernoulli, Kelvin, and Helmholtz regarding the significance of 
the vorticity of a fluid flow. 

In §6 we begin the study of weak solutions to quasi-linear hyperbolic sys- 
tems of conservation law type, possessing singularities called shocks. Section 6 is 
devoted to scalar equations, for which there is a well-developed theory. 

We then study k x & systems of conservation laws, with k > 2, in §§ 7-10, 
restricting attention to the case of one space variable. Section 7 is devoted to the 
“Riemann problem,” in which piecewise-constant initial data are given. Section 8 
discusses the role of “entropy” and of “Riemann invariants” for systems of con- 
servation laws. These concepts are used in §9, where we establish a result of 
R. DiPerna [DiP4] on the global existence of entropy-satisfying weak solutions 
for a class of 2 x 2 systems, in one space variable. 

The first nonlinear hyperbolic system we derived, in § 1 of Chap.2, was the 
system for vibrating strings. We return to this in § 10. Far from setting down a 
definitive analysis, we make note of some further subtleties that arise in the study 
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of vibrating strings, giving rise to problems that have by no means been overcome. 
This starkly illustrates that in the study of nonlinear hyperbolic equations, a great 
deal remains to be done. 


1. Quasi-linear, symmetric hyperbolic systems 


In this section we examine existence, uniqueness, and regularity for solutions to a 
system of equations of the form 


3) 
(1.1) HT = L(t,2,u,D,)utg(t,z,u), u(0) =f. 
We derive a short-time existence theorem under the following assumptions. We 
suppose that 


(1.2) L(t,x,u,D,)v = S > Aj(t, 2, u) Ov 
Jj 


and that each A; is a kK x K matrix, smooth in its arguments, and furthermore 
symmetric: 

(1.3) Aj = Aj. 
We suppose g is smooth in its arguments, with values in R“; u = u(t, 7) takes 
values in R*. We then say (1.1) is a symmetric hyperbolic system. For sim- 
plicity, we suppose x € M = T”, though any compact manifold M can be 
treated with minor modifications, as can the case M = R”. We will suppose 
f¢ H*(M),k>n/2+1. 

Our strategy will be to obtain a solution to (1.1) as a limit of solutions uz to 


(14) Oe =JeLeJette + 9e, te(0) = f, 
where 
(1.5) Lev = S > Aj(t,2, Jeuz) Ojv 
j 
and 
(1.6) Ge = Jeg(t, ©, Jee). 


In (1.4), f might also be replaced by J, f, though this is not crucial. Here, {Jz : 
0 < € < 1} isa Friedrichs mollifier. For M@ = T”, we can define J. by a Fourier 
series representation: 
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(1.7) (Jev)* (2) = plead), LEZ", 
given yp € C§°(R”), real-valued, (0) = 1. For any e > 0, (1.4) can be regarded 
as a system of ODEs for u-, for which we know there is a unique solution, 
for ¢ close to 0. Our task will be to show that the solution u- exists for ¢ in an 


interval independent of ¢ € (0, 1] and has a limit as « \, 0 solving (1.1). 
To do this, we estimate the H*-norm of solutions to (1.4). We begin with 


d 
ay =, || D°ue(t)||22 = 2(D° JeLeJete, D“ue) + 2(D% ge, Due). 


Since J. commutes with D® and is self-adjoint, we can write the first term on the 
right as 


(1.9) 2(L-D° Jeue, D° Jeuz) + 2([D%, Le|Jeue, D° Sete). 


To estimate the first term in (1.9), note that, by the symmetry hypothesis (1.3), 


(1.10) (Le + Lt)v = — 0; A;(t, x, Jeue)]v, 
J 
so we have 
(1.11) 2(L_-D° Jeuz, D° Jeue) < C(||Jeue(2)||o1) || D% Jette ||2.2. 


Next, consider 


(1.12) [D%, Lelv = > (D%(Aje Oj) — AjeD*(8j»)), 


J 


where Aj;- = Aj(t,x,J-ue). By the Moser estimates from Chap. 13, §3 (see 
Proposition 3.7 there), we have 


(1.13) |I[D%,Lelolle2 SO D> (Apel a lOjolle~ + IV Ajell z= l3jell—)) 
J 


provided |a| < k. We use this estimate with v = J-u-. We also use the estimate 
(1.14) |Aj(t, 2, Jete) lle < Cr ([Jetel|z~) (1 + [[Jeuella«), 
which follows from Proposition 3.9 of Chap. 13. This gives us control over the 


terms in (1.9), hence of the first term on the right side of (1.8). Consequently, we 
obtain an estimate of the form 


d 
(115) Ge [[tte() [lire S Ce (|| Jeweller) (1+ |Jeve() Ile) 
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This puts us in a position to prove the following: 


Lemma 1.1. Given f ¢ H*, k > n/2 +1, the solution to (1.4) exists for t in an 
interval I = (—A, B), independent of €, and satisfies an estimate 


(1.16) luc(Ollae < KW), tL, 
independent of € € (0, 1]. 


Proof. Using the Sobolev imbedding theorem, we can dominate the right side of 
(1.15) by E(||ue(#)||Z,x), 80 ||ue(t) ||7p. = y(t) satisfies the differential inequality 


(1.17) — < Ey), y(0) =f \lze- 


Gronwall’s inequality yields a function K (t), finite on some interval [0, B), giving 
an upper bound for all y(t) satisfying (1.17). Using time-reversibility of the class 
of symmetric hyperbolic systems, we also get a bound c(t) for y(t) on an interval 
(—A, 0]. This J = (—A, B) and K(t) work for (1.16). 

We are now prepared to establish the following existence result: 


Theorem 1.2. Provided (1.1) is symmetric hyperbolic and f € H*(M), with 
k > n/2 +1, then there is a solution u, on an interval I about 0, with 


(1.18) u € L©(I,H*(M)) nN Lip(I, H*-1(M)). 
Proof. Take the I above and shrink it slightly. The bounded family 
ue € OL, OCU, A*") 


will have a weak limit point u satisfying (1.18). Furthermore, by Ascoli’s theorem, 
there is a sequence 


(1.19) Ue, —> u in C(I, H*-1(M)), 
since the inclusion H* Cc H*~! is compact. Also, by interpolation inequalities, 
{ue : 0 < € < 1} is bounded in C’(I, H*~?(M)) for each o € (0,1), so since 


the inclusion H*~” <— C1(M) is compact for small o > 0 if k > n/2 +1, we 
can arrange that 


(1.20) Ue, —> u inC(I,C'(M)). 


Consequently, with e = e,, 
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JeL(t, x, Jeuc, D) Jette + Jeg(t, x, Jette) 


1.21 
ed > L(t,2,u, D)u+ g(t,z,u) inCUx M), 


while clearly Ou., /Ot —- Ou/Ot weakly. Thus (1.1) follows in the limit from 
(1.4). 


Let us also note that, with y(t) = ||u-(t)||7,., we have, by (1.15), 


d 
(1.22) cau t alt), alt) =Cx([ldeueller); 
Ne) 
t 
(1.23) y(t) < e (y(0) +f a(s)e~°(s) ds), 
0 


with b(t) = fis a(s) ds. It follows that we have {ue : 0 < € < 1} bounded 
in C(I, H*) © Lip(I,H*~+), as long as k > n/2 + 1, with convergence 
(1.19)-(1.21). A careful study of the estimates shows that J can be taken to be 
independent of & (provided k > $n + 1). In fact, a stronger result will be estab- 
lished in Proposition 1.5. 

There are questions of the uniqueness, stability, and rate of convergence of uz 
to u, which we can treat simultaneously. Thus, with « € [0,1], we compare a 
solution u to (1.1) with a solution u, to 


a 
(1.24) 7a = JeL(t, x, Jee, D) Jee + Jeg(t,2, Jette), te(0) = h. 


Set v = u— Ue, and subtract (1.24) from (1.1). Suppressing the variables (t, x), 
we have 


(1.25) - = L(u, D)v+ L(u, D)ue — JeL(Jeue, D) Jeuz +. g(u) — Jeg(Jeue). 


Write 
L(u, D)ue — JeL( Sete, D) Jette 
+ "on 
+ Je[L(ue,D) — L(Jete, D)| Jee, 
and 
(1.27) g(u) — Jeg(Jete) = [g(u) — g(ue)] + 1 — Je)g(ue) 


+ Jelg(ue) — g(Jeue)]- 


Now write 
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g(u) — g(w) = Gu, w)(u— w), 


(1.28) 1 
G(u, w) = yh g (ru +(1- T)w) dr, 
0 


and similarly 
(1.29) L(u, D) — L(w, D) = (u-— w)- M(u, w, D). 


Then (1.25) yields 
Ov 


(1.30) a L(u, D)u + A(u, ue, Vue)u + Re, 
where 
(1.31) A(u, Ue, Vue)u = u- M(u, ue, D)ue + G(u, ue)v 


incorporates the first terms on the right sides of (1.26) and (1.27), and R, is the 
sum of the rest of the terms in (1.26) and (1.27). Note that each term making up 
R- has as a factor I — J-, acting on either u-, g(u-), or L(u-, D)u-. Thus there 
is an estimate 


(1.32) || Re(t)l|Z2 < Ce(Ilue(#)llor) (1 + Ilete(E) ize) re)”, 
where 
(1.33) rp(é) = |Z — Te\| cuHe-1,12) y |Z — Je\| cH, A): 


Now, estimating (d/dt)||v(t)||72 via the obvious analogue of (1.8)-(1.15) 
yields 


(1.34) < lWOllz2 < COllo@llz2 + SO, 
with 
(1.35) C(t) = C(|lue(E)Ilor, llu(Ilor), S(t) = || Re (8)|Z2- 


Consequently, by Gronwall’s inequality, with K(t) = i 


C(r) dr, 
t 
(1.36) llv(t)||Z2 < eK (If —hl|F2 +f S(r)e7*™ dr), 
0 


for t € [0, B). A similar argument with time reversed covers t € (—A, 0], and we 
have the following: 
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Proposition 1.3. For k > n/2+1, solutions to (1.1) satisfying (1.18) are unique. 
They are limits of solutions uz to (1.4), and, fort € I, 


(1.37) Ilu(t) — we) [lz S Ba) — Jellcca*-1,12)- 


Note that if J. is defined by (1.7) and y(€) = 1 for |E| < 1, we have the 
operator norm estimate 


(1.38) IZ — Jellecre-1,n2) < C e*}. 


Returning to properties of solutions of (1.1), we want to establish the following 
small but nice improvement of (1.18): 


Proposition 1.4. Given f ¢ H®, k > n/2 +1, the solution u to (1.1) satisfies 
(1.39) we C,H"). 


For the proof, note that (1.18) implies that u(t) is a continuous function of t 
with values in H*(M), given the weak topology. To establish (1.39), it suffices to 
demonstrate that the norm ||w(t) || 7« is a continuous function of t. We estimate the 
rate of change of ||u(t)||7,, by a device similar to the analysis of (1.8). Unfortu- 
nately, it is not useful to look directly at (d/dt)||D°u(t)||7.. when |a| = k, since 
LD%u may not be in L?. To get around this, we throw in a factor of J-, and for 
la| < k look at 


d 
(1.40) G De Jeult) Ilia = 2(D°JeL(u, D)u, D° Jeu) 
+ 2(D* Jeg(u), DJ) : 


As above, we have suppressed the dependence on (t,x), for notational con- 
venience. The last term on the right is easy to estimate; we write the first 
term as 


(1.41) 2(D°L(u, D)u, D* J2u) = 2(LD%u, D®* Ju) + 2([D®%, Llu, D* J2u). 


Here, for fixed t, L(u, D)D°u € H~!(M), which can be paired with D° J2u € 
C™(M). We still have the Moser-type estimate 


(1.42) I|[D%, LJullz2 < CllA5(u) lar llullos + Cl A5(M)Ilcrllullae, 


parallel to (1.13), which gives control over the last term in (1.41). We can write 
the first term on the right side of (1.41) as 


(1.43) (LADD? Jt, D° Ju) + 2([Je,£)D%, Du). 


442 16. Nonlinear Hyperbolic Equations 


The first term is bounded just as in (1.10)—(1.11). As for the last term, we have 


(1.44) [Je, L]w = S°[A;(u), Je] Ow. 


J 
Now the nature of J_ as a Friedrichs mollifier implies the estimate 
(1.45) II[Aj, Je]Ojwllz2 < Cll Agllos|wllz2; 


see Chap. 13, § 1, Exercises 1-3. 
Consequently, we have a bound 


d 
(1.46) ag Meu Ollire SC (elles) uO liv, 


the right side being independent of ¢ € (0, 1]. This, together with the same anal- 
ysis with time reversed, shows that ||J-u(t)||7,, = N-(t) is Lipschitz continuous 
in t, uniformly in ¢. As J-u(t) — u(t) in H*-norm for each t € J, it follows that 
l|u(t) |Z; = No(t) = lim N-(t) has this same Lipschitz continuity. Proposition 
1.4 is proved. 

Unlike the linear case, nonlinear hyperbolic equations need not have smooth 
solutions for all t. We will give some examples at the end of this section. Here we 
will show, following [Mj], that in a general context, the breakdown of a classical 
solution must involve a blow-up of either sup,, |u(t, x)| or sup, |V.u(t, x)|. 


Proposition 1.5. Suppose u € C([0,T), H*(M)), k >n/2+1(n=dim M), 
and assume wu solves the symmetric hyperbolic system (1.1) fort € (0,T). Assume 
also that 


(1.47) llu(t)Ilcx~) < K < 00, 


fort € (0,7). Then there exists T, > T such that u extends to a solution to (1.1), 
belonging to C({0,T,), H*(M)). 


We remark that, if A;(t,2, uw) and g(t, x,u) are C™ in a region R x M x Q, 
rather than in all of R x M x R*®, we also require 


(1.48) u(t,z) CQ, CCQ, fort € [0,T). 
Proof. This follows easily from the estimate (1.46). As noted above, with 
N.(t) = ||Jeu(t)||3,.. we have N.(t) + No(t) = |lu(é)||Z,. pointwise as 


€ — 0, and (1.46) takes the form dN. /dt < Ci(t)No(t). If we write this in an 
equivalent integral form: 


t+T 
(1.49) N.(t +7) <N(t)+ [ C1(s)No(s) ds, 
t 
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it is clear that we can pass to the limit e — 0, obtaining the differential inequality 


dN 
(1.50) FS Clllu()lex) Nol) 
for the Lipschitz function No(t). Now Gronwall’s inequality implies that No(t) 
cannot blow up as ¢ 7 T unless ||u(t)||c1 does, so we are done. 


One consequence of this is local existence of C'°°-solutions: 


Corollary 1.6. [f (/.1) is a symmetric hyperbolic system and f © C®°(M), then 
there is a solution u € C(I x M), for an interval I about 0. 


Proof. Pick k > n/2-+ 1, and apply Theorem 1.2 (plus Proposition 1.4) to get a 
solution u € C(I, H*(M)). We can also apply these results with f ¢ H‘(M), for 
¢ arbitrarily large, together with uniqueness, to get u € C(J, H’(M)), for some 
interval J about 0, but possibly J is smaller than J. But then we can use Propo- 
sition 1.5 (for both forward and backward time) to obtain u € C(I, H‘(M)). 
This holds, for fixed J, and for arbitrarily large @. From this it easily follows that 
weCe(lx M). 


We make some complementary remarks on results that can be obtained from 
the estimates derived in this section. In particular, the arguments above hold when 
A;(t,x,u) and g(t, x, u) have only H*-regularity in the variables (a, u), as long 
as k > n/2 + 1. This is of interest even in the linear case, so we record the 
following conclusion: 


Proposition 1.7. Given Aj(t,x) and g(t,x) in C(I,H*(M)), k > $n +1, 
Aj = Aj, the initial-value problem 


Ou 


(1.51) a 


>_> Aj(t,x) Oju+ g(t,x), u(0) = f € H*(M), 


has a unique solution in C(I, H*(M)). 


In some approaches to quasi-linear equations, this result is established first and 
used as a tool to solve (1.1), via an iteration of the form 


0 
(1.52) att = S Aston) Ojty41 + g(t, 2,uUv), Uv41(0) = f, 
J 


beginning, say, with uo(t) = f. Then one’s task is to show that {u,} converges, 
at least on some interval J about t = 0, to a solution to (1.1). For details on this 
approach, see [Mj1]; see also Exercises 3-6 below. 

The approach used to prove Theorem 1.2 has connections with numerical 
methods used to find approximate solutions to (1.1). The approximation (1.4) 
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is a special case of what are sometimes called Galerkin methods. Estimates 
established above, particularly Proposition 1.3, thus provide justification for some 
classes of Galerkin methods. 

For nonlinear hyperbolic equations, short-time, smooth solutions might not 
extend to solutions defined and smooth for all t. We mention two simple examples 
of equations whose classical solutions break down in finite time. First consider 


Ou» 


(1.53) BY u(0,2) =1. 
The solution is 
(1.54) (t,2) = — 
* U — ne! 
a 1 _ t? 
for t < 1, which blows up as t 7 1. 
The second example is 
(1.55) Uz +uuz =0, u(0,x) = e°. 
Writing the equation as 
3) 6) 
1. (= —) =V; 
(1.56) at + une u=0 


we see that u(t,2) is constant on straight lines through (2,0), with slope 
u(0,2)~', in the (a, t)-plane, as illustrated in Fig. 1.1. The line through (0, 0) 
has slope 1, and that through (1, 0) has slope e; these lines must intersect, and by 
that time the classical solution must break down. In this second example, u(t, x) 
does not become unbounded, but it is clear that sup,, |u,(t, 7)| does. As we will 
discuss further in § 7, this provides an example of the formation of a shock wave. 


u=l u=l/e 


FIGURE 1.1 Crossing Characteristics 
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A detailed study of breakdown mechanisms is given in [Al]. 


Exercises 


1. Establish the results of this section when M is any compact Riemannian manifold, 
by the following route. Let {X; : 1 < j < N} be a finite collection of smooth 
vector fields that span the tangent space TM for each x. With J = (j1,..., jx), set 
X7 = Xj, ---Xj,;set |J| = k. Also set X° = I, || = 0. Then use the square norm 


lulls = SOX? allie. 
|J|<k 


Also, let Jz = e**. To establish an analogue of (1.15), it will be useful to have 
[X”, Je] bounded in OPS{5'(M), for |J| =k, € € (0,1). 


2. Consider a completely nonlinear system 


O 
(1.57) TF = F(t,x,u,Veu), u(0) =f. 
Suppose u takes values in R®. Form a first-order system for (vo,v1,-..,Un) = 
(u, O1u,...,; Ont): 
ov = Fit, z, v), 
(1.58) dv, 


ap = \ > (Au,F)(t, 2, v) Oevj + (Ox; F)(t, 2,0), l<j<n, 
£=1 


with initial data 


v(0) =, (f,O1f,..-,Onf). 
We say (1.57) is symmetric hyperbolic if each 0,, F' is a symmetric kK x K matrix. 


Apply methods of this section to (1.58), and then show that (1.57) has a unique 
solution u € C(I, H*(M)), given f € H*(M), k > n/24+2. 


Exercises 3—6 sketch how one can use a slight extension of Proposition 1.7 to show 
that the iterative method (1.52) yields u, converging for |¢| small to a solution to the 
quasi-linear PDE (1.1). Assume f € H*(M), k >n/2+1. 

3. Extend Proposition 1.7, by taking f € H*, and g € C(I, H°), for & € [0, k] (while 
keeping A; € C(I, H*) and k > n/2 + 1), and obtaining u € C(, H). 

4. Granted Proposition 1.7, show that {u,} is bounded in C(I, H*(M)), after possibly 
shrinking J. (Hint: Produce an estimate of the form 


leusa(Olire < {Illi + f “go(r) dr} exp( [ “we(s) ds), 


where y,(s) = y(||uv(s)|| zx) and y(s) = w(||ur(s)|| 7x). Then apply Gronwall’s 
inequality.) 
5. Derive an estimate of the form 
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tu) — wo(Olfina < Ale sup els) ~ e269) 
s€[0,t 


for t € I, and deduce that {u,} is Cauchy in C(I, H*~1(M)), after possibly further 
shrinking J. (Hint: With w = uy+1 — uv, look at a linear hyperbolic equation for w 
and apply the extension of Proposition 1.7 to it, with € = k — 1.) 
6. Deduce that {u,} has a limit u ¢ C(I, H*~'(M)) NL© (I, H*(M)), solving (1.1). 
7. Suppose ui and uz are sufficiently smooth solutions to (1.1), with initial data 


u;(0) = f;. Assume (1.1) is symmetric hyperbolic. Produce a linear, symmetric 
hyperbolic equation satisfied by wi — we. If f: = fe on an open set O C M, deduce 
that ui = ue2 on a certain subset of R x M, thus obtaining a finite propagation 


speed result, as a consequence of the finite propagation speed for solutions to linear 
hyperbolic systems, established via (5.26)-(5.34) of Chap. 6. 

8. Obtain a smooth solution to (1.1) on a neighborhood of {0} x M in R x M when 
f € C®°(M) and M is any open subset of R”. (Hint: To get a solution to (1.1) ona 
neighborhood of (0, xo), identify some neighborhood of xo in M with an open set in 
T” and modify (1.1) to a PDE for functions on R x T”. Make use of finite propagation 
speed to solve the problem.) 

9. Let T, be the largest positive number such that (1.55) has a smooth solution for 
0 <t< T;,. Show that, in this example, 


lu(@)\loi/sqay) < K < co, forO<t< Ts. 


(Hint: For s = T, — t 7 0, consider similarities of the graph of x > u(t,x) = y 
with the graph of « = —y® — sy.) 
10. Show that the rays in Fig. 1.1 are given by 
®(x,t) = (a+ ie a): 


and deduce that 7’. in Exercise 9 is given by 


rT, = /é. 
2 


11. Consider a semilinear, hyperbolic system 


(1.59) ou =Lu+g(u), u(0)=f. 


Paralleling the results of Proposition 1.5, show that solutions in the space 
CU, H*(M)), k > n/2, persist as long as one has a bound 


In Exercises 12-14, we consider the semilinear system (1.59), under the following 
hypothesis: 


(1.61) g(0) =0,  |g’(u)| < C. 


For simplicity, take M@ = T”. 
12. Let uz be a solution to an approximating equation, of the form 
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Oue 
Ot 


(1.62) = J-LJeue + Jeg(Jeue), ue(0) = f. 


Show that 


|uellz2 S Clluellz2, |Vuellz2 < Cll Vuellz2- 


{| {| 
dt dt 
Deduce that, for any ¢ > 0, (1.62) has a solution, defined for all t € R, and, for any 
compact J C R, we have 

ue bounded in L®(I,H'(M)) NM Lip(I, L7(M)). 


13. Deduce that, passing to a subsequence uz, , we have a limit point u € Le (R, Ht! (M)) 
A Lip,,.(R, £?(M)), such that 


Ue, >u in C(I, L7(M)) 
in norm, for all compact I C R; hence g(Jz,Ue,) > g(w) in C(R, L?(M)), and wu 
solves (1.59). Examine the issue of uniqueness. 
Remark: This result appears in [JMR]. The proof there uses the iterative method (1.52). 
14. If dim M = 1, combine the results of Exercises 11 and 13 to produce a global smooth 
solution to (1.59), under the hypothesis (1.61), given f € C'°°(M) and g smooth. 


Remark: If dim M is large, the global smoothness of u is open. For some results, see 
[BW]. 


2. Symmetrizable hyperbolic systems 
The results of the previous section extend to the case 


(2.1) Ao(t, x, u) au = S > Aj(t, 2, u) Oj;u+ g(t,z,u), u(0) =f, 


j=l 
where, as in (1.3), all A; are symmetric, and furthermore 
(2.2) Ao(t,z,u) > cI > 0. 
We have the following: 


Proposition 2.1. Given f € H*(M), k > n/2+1, the existence and uniqueness 
results of § 1 continue to hold for (2.1). 


We obtain the solution wu to (2.1) as a limit of solutions wu, to 


0 
(2.3) Ao(t, a, Jete) = = JeLeJette + ge, ue(0) = f, 


ot 
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where L, and g- are as in (1.5)-(1.6). We need to parallel the estimates of § 1, 
particularly (1.8)-(1.15). The key is to replace the L?-inner products by 


(2.4) (w, Aoe(t)w) po, Aoz (t) = Ao(t, x, JeUe), 
which by hypothesis (2.2) will define equivalent L? norms. We have 


& (Due, Age (t) Due) = 2(D° (du. /dt), Ape (t)D° ue) 


+ (D° ue, Ap. (t)D° ue). 


Here and below, the L?-inner product is understood. The first term on the right 
side of (2.5) can be written as 


(2.6) 2(D° ApeO;te, D°ue) + 2([D%, Aoe]Opue, Due); 


in the first of these terms, we replace Ao-(Ou,/Ot) by the right side of (2.3), 
and estimate the resulting expression by the same method as was applied to the 
right side of (1.8) in § 1. The commutator [D®, Ao] is amenable to a Moser-type 
estimate parallel to (1.12); then substitute for Ou, /Ot, AG. times the right side 
of (2.3), and the last term in (2.6) is easily estimated. It remains to treat the last 
term in (2.5). We have 


d 
(2.7) Aj. (t) = Giolh« Jeue(t, x)), 
hence 
(2.8) | Aoe(t) II x22 < C (|| Jeue(4)||b~, || Jews (4)|| z=). 


Of course, ||Ou-/Ot||z2 can be estimated by ||u.(t)||c1, due to (2.3). Conse- 
quently, we obtain an estimate parallel to (1.15), namely 


d 
(2.9) = D7 (D%uc, AoeD* te) < Cr (llete(E)|le2) (1 + [Jette )IlFre)- 


|a|<k 


From here, the rest of the parallel with § | is clear. 
The class of systems (2.1), with all Aj = Aj and Ag > cl > 0, is an extension 
of the class of symmetric hyperbolic systems. We call a system 


>> B;(é,2,u) 04+ gt,2,u), u(0) = f, 


j=l 


(2.10) a 


a symmetrizable hyperbolic system provided there exist Ao(t,x,u), positive- 
definite, such that Ao(t, 2, u)B;(t,z,u) = Aj(t,x,u) are all symmetric. Then 
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applying Ao(t, x, u) to (2.10) yields an equation of the form (2.1) (with different 
g and f), so the existence and uniqueness results of § 1 hold. The factor A(t, x, wu) 
is called a symmetrizer. 

An important example of such a situation is provided by the equations of 
compressible fluid flow: 


0 1 
5 t Vout — grad p= 0, 
(2.11) p 


) 

a + Ver + pdivv=0. 

Here v is the velocity field of a fluid of density p = p(t, z). We consider the model 
in which p is assumed to be a function of p. In this situation one says the flow is 
isentropic. A particular example is 


(2.12) P(p) = Ap’, 


with A > 0, 1 < y < 2; for air, y = 1.4 is a good approximation. One calls 
(2.12) an equation of state. Further discussion of how (2.11) arises will be given 
in §5. 

The system (2.11) is not a symmetric hyperbolic system as it stands. However, 
one can multiply the two equations by b(p) = p/p’(p) and p~', respectively, 
obtaining 


(2.13) Gi ) 2 (°) a ee : ] ()) . 


Now (2.13) is a symmetric hyperbolic system of the form (2.1). Recall that 


(2.14) (div v, f)r2 = —(v, grad f)p2. 


Thus the results of § 1 apply to the equation (2.11) for compressible fluid flow, as 
long as p is bounded away from zero. 

Another popular form of the equations for compressible fluid flow is obtained 
by rewriting (2.11) as a system for (p, v); using (2.12), one has 


p + Vop + (yp) div v = 0, 
(2.15) a 
a Vvyut+a(p) grad p = 0, 


where o(p) = 1/p(p) = (A/p)'/7. This is also symmetrizable. Multiplying these 
two equations by (yp)~! and p(p), respectively, we can rewrite the system as 


(in) () =—(Cme ne.) () 
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See Exercises 3—4 below for another approach to symmetrizing (2.11). 

We now introduce a more general notion of symmetrizer, following Lax [L1], 
which will bring in pseudodifferential operators. We will say that a function 
R(t, u,z,€), smooth on R x R* x T*M \ 0, homogeneous of degree 0 in €, 
is a symmetrizer for (2.10) provided 


(2.16) R(t, u, x, €) is a positive-definite, K x K matrix 
and 
(2.17) R(t, u,v, €) ) > Bj(t,x, u)&; is self-adjoint, 


for each (t,u,x,€). We then say (2.10) is symmetrizable. One reason for the 
importance of this notion is the following: 


Proposition 2.2. Whenever (2.10) is strictly hyperbolic, it is symmetrizable. 


Proof. If we denote the eigenvalues of L(t, u,x,€) = >> B;(t, x, u)&; by 
Ai(t, u, ,€) Ss Ax (é, u, 2, ), 


then the ,, are well-defined C™-functions of (t, u, x, €), homogeneous of degree 
lin €. If P,(t, u, x, €) are the projections onto the \,,-eigenspaces of L, 


1 


2.1 2p Sa 
G8) 271 


iG i L(t,u,2,€)) dc, 


Y 


then P,, is smooth and homogeneous of degree 0 in €. Then 


(2.19) R(t, u, x, €) EE P,(t, u, 2, €) 


gives the desired symmetrizer. 


We will use results on pseudodifferential operators with nonregular symbols, 
developed in Chap. 13, § 9. Note that 


(2.20) eC = ReC' Ss, 


where the symbol class on the right is defined as in (9.46) of Chap. 13. Now, with 
R= R(t,u, x, D), set 


(2.21) Q= S(R+R) + KA, 


where K > 0 is chosen so that Q is a positive-definite operator on L?. 
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We will work with approximate solutions u- to (2.10), given by (1.4), with 


(2.22) Lgr= SB; Gay Jette) Oh 
2 


Given |a| = k, we want to obtain estimates on (D°u-(t), Q-D°ue(t)), where 
Q. arises by the process above, from Rz = R(t, Jeue, xz, €). We begin with 


d 
dt (Dae: C01.) 
(2.23) = 2(D° Aue, QeD° ue) + (De, QD Ue) 
= 2Re(D°0,uc, ReD*ue) + 2K (D*Ojue, A71 Due) 
+2Re(D°ue, RED“ ue). 


For the last term, we have the estimate 
(2.24) (Due, RED“ ue)| < C(||Ue(t) |] c+) lle (t) Fre: 


We can write the first term on the (far) right side of (2.23) as twice the real part 
of 


(2.25) (R-D° JeLeJetic, D°ue) + (ReD% ge, Duc.) 
The last term has an easy estimate. We write the first term in (2.25) as 


(ReLeD* Jette; D* Jou.) + (Re(D™, L.|Jotic, D® eu) 


(2.26) i . 
+([R-D%, J-|LeJeue, Due). 


Note that as long as (2.20) holds, with r > 0, R- also has symbol in Cra, 
and we have, by Proposition 9.9 of Chap. 13, 


Q27) R=RF+m, RP eoPps);, Reeopolrs |. 
Furthermore, by (9.42) of Chap. 13, 

(2.28) DERE (#,£)€ Sis, |ol=1 

if r > 0. In (2.27) and (2.28) we have uniform bounds for € € (0, 1]. Take 6 close 
enough to | that (1+ 7)5 > 1. We then have [R., J.] bounded in £(H~', L?), 


upon applying Proposition 9.10 of Chap. 13 to R®. Hence we have 


(2.29) [R-D%, J-] bounded in £(H*~1, L), 
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with bound given in terms of ||w-(t)||ci+r. Now Moser estimates yield 

(2.30) ||LeJette|| zre-1 S C'(||Uell z=) || uel] ae + C(|[Uellor) ||tell aes. 

Consequently, we deduce 

(231) |([ReD®, Je|LeJeue, D*ue)| < C((lue(t)||or+) [Ite () Ile: 
Moving to the second term in (2.26), note that, for L = )> B,(t, x, u) 0; 


(2.32) [D*, L] = }“[D*, B,(t, 2, u)] dj. 
Jj 


By the Moser estimate, as in (1.13), we have 


2.33) |[D*, Llu] po $C |IByllips lollare + [Bille llollip |. 
j 
Hence the second term in (2.26) is bounded by C'(||we||c1) |uel|Z,x- 
It remains to estimate the first term in (2.26). We claim that 
(2.34) |(ReLev, v)| < C(|lueller) llullz2- 
To see this, parallel to (2.27), we can write 
Q35) Lira TP eoPst,, eoro ss, ™, 
and, parallel to (2.28), 
(2.36) D2 LF (2,6) € Sis, |a)=1. 


Now, provided (1 + r)6 > 1, 


(2.37) R,L, = RF L® mod £(L*) 
and 
(2.38) (RF L#)" =—R#L? mod OPS) 5, 


so we have (2.34). 
Our analysis of (2.23) is complete; we have, for any r > 0, 


d 
(2.39) G (Due, QeD% ue) S C(lue(t)l|or+r) llue(t) Ize, lal =k. 
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From here we can parallel the rest of the argument of § 1, to prove the following: 


Theorem 2.3. [f (2.10) is symmetrizable, in particular if it is strictly hyperbolic, 
the initial-value problem, with u(0) = f € H*(M), has a unique local solution 
u € C(I, H*(M)) whenever k > n/2 +1. 


We have the following slightly weakened analogue of the persistence result, 
Proposition 1.5: 


Proposition 2.4. Suppose u € C((0,T),H*(M)), k > n/2 +1, and assume 
u solves the symmetrizable hyperbolic system (2.10) for t € (0,T). Assume also 
that, for some r > 0, 


(2.40) u(t) ||c1+r(my) < K < ~, 


fort € [0,T). Then there exists T, > T such that u extends to a solution to (2.10), 
belonging to C({0,T,), H*(M)). 


For the proof of this (and also for the proof of the part of Theorem 2.3 asserting 
that whenever f € H k (M), then u is continuous, not just bounded, in t, with 
values in H*(M)), one estimates 


ae a 
— (D*Jeu(t), QD" Jeu(t)) 


in place of (1.40). Then estimates parallel to (2.24)-(2.39) arise, as the reader can 
verify, yielding the bound 


|u(t) lpm 


d 
(2.41) a y (D* Jeu(t), QD* Jeu(t)) < C(|u(t)||c1+r) 
lja|<k 


If we use this in place of (1.46), the proof of Proposition 1.5 can be parallelled to 
establish Proposition 2.4. 

It follows that the result given in Corollary 1.6, on the local existence of C'°- 
solutions, extends to the case of symmetrizable hyperbolic systems (2.10). 

We mention that actually Proposition 2.4 can be sharpened to the level of 
Proposition 1.5. In fact, they can both be improved; the norms C!*"(M) and 
C1(M) appearing in the statements of these results can be weakened to the Zyg- 
mund norm C}(/). A proof, which is somewhat more complicated than the proof 
of the result established here, can be found in Chap. 5 of [Tay]. 


Exercises 
1. Show that, for smooth solutions, (2.11) is equivalent to 


pr + div(pv) = 0, 


(2.42) 
vz + Vou + grad h(p) = 0, 


454 16. Nonlinear Hyperbolic Equations 


assuming p = p(p). Here, h(p) satisfies 


2. Assume v is a solution to (2.42) of the form v = V(t, x), for some real-valued y. 
One says v defines a potential flow. Show that if y and p vanish at infinity appropriately 
and h(0) = 0, then 


1 
(2.43) ye + 5lVeel” + h(p) = 0. 


This is part of Bernoulli’s law for compressible fluid flow. Compare with (5.45). 
3. Set m = pv, the momentum density. Show that, for smooth solutions, (2.42) is equiva- 
lent to 


pit+ divm =0, 


(2.44) wer 
mz + div(p “m®m)-+ grad p(p) = 0. 


(Hint: Make use of the identity div(u ® v) = (div v)u + Voyu.) 
4. Show that a symmetrizer for the system (2.44) is given by 


Lfp'(e)t+eP -v) im 
p —v' i i po 


Reconsider this problem after doing Exercise 4 in § 8, in light of formulas (8.26)—(8.29) 
for one space dimension, and of formula (5.53) in general. 
5. Consider the one space variable case of (2.10): 


(2.45) ut = Bit, z,u)ur + g(t,z,u), u(0) = f. 


Show that if this is strictly hyperbolic, that is, B(t,x,u) is a K x K matrix-valued 
function whose eigenvalues A,(t,x,u) are all real and distinct, then (2.45) is sym- 
metrizable in the easy sense defined after (2.10). (Hint: Eliminate the €s from the proof 
of Proposition 2.2.) 


3. Second-order and higher-order hyperbolic systems 


We begin our discussion of second-order equations with quasi-linear systems, of 
the form 


uw — )— AI*(t, 2, D'u) 0j;0,u— S> BY(t, 7, D'u) d;0;u 
(3.1) jk j 
= C(t, 2, Dtu). 


For now, we assume A?” and B? are scalar, though we allow wu to take values in 
R”. Here D1u stands for (u, uz, V2u), which we also denote W = (u, uo, v1, 
..+;Un), SO 
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a a 
(3.2) w=, wet, 1<j<n. 


= Oo. ? a — 
Ox; 


We obtain a first-order system for W, namely 


Ou _ 
OL = U0; 
G3) BOS A(t, 0, W) jun + J Bit, 2, W) djuo + Olt, 2, W), 
Ou; 
“BE = Ost 


which is a system of the form 


Ow 


(3.4) ~ 


pe ices W) 0;W + g(t, x,W). 
J 


We can apply to each side the matrix 
(3.5) R= {0 1 


(tensored with the L x L identity matrix), where A~! is the inverse of the matrix 
A = (A’*), The matrix R is positive-definite as long as A is, that is, as long as A 
is symmetric and 


(3.6) SAP (4,0, WEE > Cl€P?, C>0. 

Under this hypothesis, (3.3) is symmetrizable. Consequently we have: 
Proposition 3.1. Under the hypothesis (3.6), if we pick initial data f € 
H**1(M), g € H*(M), k > n/2 4-1, then (3.1) has a unique local solu- 
tion 

(3.7) u € C(I, H**1(M)) nC1(1, H*(M)) 

satisfying u(0) = f, uz(0) = g. 

Proof. Define W = (u, ug, u1,.--, Un), as the solution to (3.3), with initial data 
(3.8) u(0)=f, uo(0)=9, u,;(0) = d;f. 


By Proposition 2.1, we know that there is a unique local solution W € C 
(I, H*(M)). It remains to show that wu possesses all the stated properties. That 
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u(0) = f is obvious, and the first line of (3.3) yields uz(0) = uo(0) = g. Also, 
Uz = Uo € C(I, H*), which gives part of (3.7). The key to completing the proof 
is to show that if W satisfies (3.3) with initial data (3.8), then in fact u; = Ou /Ox Fi 
onl x M. 


To this end, set 
Ou 


Up = 
J 
Ox; 


Since we know that 0u/Ot = uo, applying 0/0t to each side yields 


Vi= 


Ov; _ Ou; Ouo 
Ot Ot Ox; 


by the last line of (3.3). Since u;(0) = 0;u(0) by (3.8), it follows that v; = 0, so 
indeed u; = O;u. Then substituting u; for up and O;u for u; in the middle line of 
(3.3) yields the desired equation (3.1) for wu. 

Finally, since u; € C(I, H*), we have Vu € C(I, H*), and consequently 
u€ C(I, H**4), 


As in § 1, we first take MM = T”. Parallel to Exercise 7 in § 1, we can establish 
a finite propagation speed result and then, as in Exercise 8 of § 1, obtain a local 
solution to (3.1) for other /. 

We note that (3.6) is stronger than the natural hypothesis of strict hyperbolicity, 
which is that, for € 4 0, the characteristic polynomial 


(3.9) a eG x,W)E;7 — san, x, W)EsEx 
J ik 
has two distinct real roots, 7 = A,(t,W,x,&). However, in the more general 


strictly hyperbolic case, using Cauchy data to define a Lorentz metric over the 
initial surface {t = 0}, we can effect a local coordinate change so that, at t = 0, 
(A?*) is positive-definite, when the PDE is written in these coordinates, and 
then the local existence in Proposition 3.1 (and the comment following its proof) 
applies. 

Let us reformulate this result, in a more invariant fashion. Consider a PDE of 
the form 


(3.10) Sa eD) 0;0,u+ F(t, x, D'u) = 0. 


jk 


We let u take values in R” but assume a/*(t, 2, W) is real-valued. Assume the 
matrix (aJ) has an inverse, (a;,). 


Proposition 3.2. Assume (aje(t, x, W)) defines a Lorentz metric on O and S Cc 
O is a spacelike hypersurface, on which smooth Cauchy data are given: 


(3.11) u=f, Yul,=g, 
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where Y is a vector field transverse to S. Then the initial-value problem (3.10- 
3.11) has a unique smooth solution on some neighborhood of S in O. 


In Chap. 18 we will apply this result to Einstein’s gravitational equations. 
We now look at a second-order, quasi-linear, L x [ system of the form 


O7u 


(3.12) aa 


—S¢ A(x, Diu) Oj)Opu = F(x, Diu), 
jk 


where, for each j,k € {1,...,n}, A%*(x,W) is a smooth, L x L, matrix-valued 
function satisfying 


(3.13) aglE?I <0 AM (a, W)EsEu < anlE|?T, 

j,k 
for some ao, a1 € (0,00). This includes equations of vibrating membranes and 
elastic solids studied in § | of Chap. 2. In particular, the condition (3.13) reflects 
the condition (1.60) of Chap. 2. Note that the system (3.12) might not be strictly 


hyperbolic. 
Here, using results of Chap. 13, § 10, we will write 


S- AI* (x, Diu) 0;0xu — F(a, Diu) 
in terms of a paradifferential operator: 


(3.14) $0 AM* (t,x, Diu) 0;0,u — F(x, Diu) = —M(u; x, D)ut R(u), 
jk 


where R(u) € C® and (parallel to (8.20) of Chap. 15) if r > 0, 
(3.15) ue OF" —> M(u;n,£) € Ag Sty + Sia 


Thus, given 6 € (0,1), we can use the symbol-smoothing process as in (10.101)— 
(10.104) of Chap. 13 to write 


(3.16) M(u; a, €) = M*(u;2,£) + M?(u;2,€), 
| M#(u;z,€) € AS S2s, M®(u;x,€) € 977°". 

As in (3.13) we have (with perhaps different constants a,;) 

(3.17) aglé|?I < M*(u;2,€) < ay |é|7J, 


for |€| > 1. We can assume M*(u; x, €) > I, for |€| < 1. Thus, given (3.15), 
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(3.18) M*(u; x, €)1/? = G(u; 2,6) € Ag’ Sts. 
Now let us set 


(3.19) H(u;x,D) = =[G(u; 2, D) + G(u; 2, D)*] + 1 € OPA St 5, 


Nl rR 


which is self-adjoint and positive-definite and satisfies 


(3.20) H(u;2,D)? — M(u;2,D) = B(u;x,D) € OPS}5+OPS;,°*"™. 


Set E(u; z,D) = H(u;xz,D)“1 € OPS; 5: and set 
(3.21) v=H(u;z,D)u, w=wu. 
We have the system 


Ut = W, 
(3.22) uy, = H(u;xz,D)wt+ Ci(u; 2, D)v, 
w, = —H(u; x, D)u + Co(u; 2, D)u + R(u), 


where 


Cy (u; 2, D) = 0;H(u; x, D)- E(u; x, D) € OPS; x; 


(3.23) : : 
Co(u;2,D) = Bu; 2, D) E(u; 2, D) € OPS, 5 + OPS 7: 


provided 6 is sufficiently close to 1 that 1 — (1+r)6 =-o <0. 
Somewhat parallel to (1.4), we obtain solutions to (3.22) as limits of solutions 
Ue to 


Ope = JeWe, 
(3.24) One = JH (Sette; x, D)J_-We a JeCy (Jee; x, Dy Jette, 
Owe = —J-H (Jee; 2, D) Jeve + JeCo(Jetie; 1, D)Jeve + R( Sete). 


Indeed, setting U- = (ue, ve, We), one obtains an estimate 
d : 
G25) S||MUL()|[F2 S C([IWellors+) [||MUe() [72 +H], 


from which local existence follows, by arguments similar to those used in § 1. We 
record the result. 


Proposition 3.3. Under the hypothesis (3.13), if we pick initial data f € 
H**'(M), g € H°(M), s > n/2 +1, then (3.12) has a unique local solution 
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(3.26) u € C(I, H*t'(M)) NC" (I, H*(M)), 


satisfying u(0) = f, uz(0) = g. 


Having considered some quasi-linear equations, we now look at a completely 
nonlinear, second-order equation: 


(3.27) tee = F(t, 2, D'u, O54, O2u), u(0) =f, u,(0) =g. 
Here F = Fi(t,x,&,n,¢) is smooth in its arguments; ¢ = (Cj4) = (0j;Onu), 


and so on. We assume uw is real-valued. As before, set v = (v0, U1,---,Un) = 
(u, O1u,...,Onu). We obtain for v a quasi-linear system of the form 


0?u9 = F(t, 2, D'v), 


Ou; = Y 7 (0, F(t, 2, D'v) 0x05 
(3.28) re 


+ S°(8n,F)(t, 2, Dv) 0;0,0; + Gi(t,x, D'v), 
9 


with initial data 


(3.29) v(0) = (fF, O1f,.--,Onf), ve(0) = (9, O19,---, Ong). 
The system (3.28) is not quite of the form (3.1), but the difference is minor. One 


can reduce this to a first-order system and construct a symmetrizer in the same 
fashion, as long as 


(3.30) PS (06,.F \(t,@, D'v)&be — > (On F(t, 2, Dv) &r 


has two distinct real roots 7 for each € ¥ 0. This is the strict hyperbolicity condi- 
tion. Proposition 3.1 holds also for (3.28), so we have the following: 


Proposition 3.4. If (3.27) is strictly hyperbolic, then given 
fc H**(M), g¢ H*(M), k> srt s 
there is locally a unique solution 
u € C(I, H**1(M)) nC, H*(M)). 
This proposition applies to the equations of prescribed Gaussian curvature, for 


a surface S that is the graph of y = u(x),a2 € Q C R”, under certain circum- 
stances. The Gauss curvature K(x) is related to u(x) via the PDE 
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n+2)/2 _ 


(3.31) det H(u) — K(x)(1+|Vul®)' 0, 


where H(u) is the Hessian matrix, 

(3.32) H(u) = (0;0,u). 
Note that, if F(u) = det H(u), then 

(3.33) DF(u)v = Tr[C(u) A (v)], 
where C(u) is the cofactor matrix of H(u), so 

(3.34) H(u)C(u) = [det H(u)|I. 


Of course, (3.31) is elliptic if A > 0. Suppose K is negative and on the hypersur- 
face © = {x,, = 0} Cauchy data are prescribed, u = f(a’), O,u = g(a’), x! = 
(@1,...,%n—1). Then 0,0;u = 0,0; f on © for 1 < j,k <n—1, 0,0j;u = 0; 
on» for 1 < 7 < n—1, and then (3.31) uniquely specifies 02u, hence H(u), 
on &, provided det H(f) # 0. If the matrix H(w) has signature (n — 1, 1), and if 
& is spacelike for its quadratic form, then (3.31) is a hyperbolic Monge—Ampere 
equation, and Proposition 3.4 applies. 


We next treat quasi-linear equations of degree m, 
m-1 
(3.35) Oru= >> Aj(t,2,D™u, Dz) But C(t,2,D™"u), 


j=0 


with initial conditions 


(3.36) (Ol) = fo, Oal0) = fi,.22507 "a) = faa. 


Here, A;(t,z,w, Dz) is a differential operator, homogeneous of degree m — j. 
Assume u takes values in R*, but for simplicity we suppose the operators A j have 


scalar coefficients. We will produce a first-order system for v = (vo,..-,; Um—1) 
with 
(3.37) vo = A 1, UR = ea du, seeyUm-1 = OF th 


We have 
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Oyvp = Avi, 


(3.38) 
Vm—2 = AUm-1, 


OUm—1 = S- A,(t, z, Pu, DA o; + C(t," z, Pv), 


where Pu = D'™—1u (ie., 080)u = 08.A5+1-™y,), so P € OPS®. Note that the 
operator A;(t,x, Pv, D,)A'tI—" is of order 1. The initial condition for (3.38) is 


(3.39) vo(0) = Fs aan Parr , 0; (0) = Pte malas Arar , Um—1(0) = Tri 1 
The system (3.38) has the form 
(3.40) Ov = L(t, a, Pu, D)v + Git, x, Pv), 


where L is an m X m matrix of pseudodifferential operators, which are scalar 
(though each entry acts on /’-vectors). Note that the eigenvalues of the principal 
symbol of E are iA, (t,x, v,€), where 7 = X, are the roots of the characteristic 
equation 


m—-1 
(3.41) r™ — \° Aj(t,, Pu, é)r? = 0. 
j=0 


We will make the hypothesis of strict hyperbolicity, that for € A 0 this equation 
has m distinct real roots, so L(t, x, Pv, €) has m distinct purely imaginary eigen- 
values. Consequently, as in Proposition 2.2, there exists a symmetrizer, anm xm, 
matrix-valued function R(t,x, w, €), homogeneous of degree 0 in € and smooth 
in its arguments, such that, for € 4 0, 


R(t,xz,w,€) is positive-definite, 


3.42 
on) R(t, x, w,€)L(t,x,w,€) is skew-adjoint. 


Note that, given r € (0,00) \ Zt, 


ve Ci => Lit,2,Pv,f)eC"sS' and 


3.43 
oe) R(t, 2, Pv,£) € Ch" 8°, 


From here, an argument directly parallel to (2.21)—(2.39) establishes the solvabil- 
ity of (3.38)-(3.39). We have the following result: 


Theorem 3.5. [f (3.35) is strictly hyperbolic, and we prescribe initial data f; € 
Hst™—1-3(M), s > n/2 +1, then there is a unique local solution 
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u € C(I, H*t™1(M))nC™ (1, H°(M)), 
which persists as long as, for some r > 0, 
llu(t)llomer +o + OP u(t) louer 


is bounded. 


In [Tay] it is shown that the solution persists as long as 
llu(é)\lom ++ +++ [OP * u(t) Ilo 


is bounded. 

While there is a relative abundance of second-order hyperbolic equations and 
systems arising in various situations, particularly in mathematical physics, com- 
pared to the higher-order case, nevertheless there is value in studying higher-order 
equations, in addition to the fact that such study arises as a “natural” extension 
of the second-order case. We mention as an example the appearance of a third- 
order, quasi-linear hyperbolic equation, arising from the study of relativistic fluid 
motion; this will be discussed in §8 6 and 8 of Chap. 18. 


Exercises 


1. Formulate and prove a finite propagation speed result for solutions to (3.1). 

2. Recall Exercise 2 of § 2, dealing with the equation (2.42) for compressible fluid flow 
when v has the special form v = V2y(t, x). Show that vy satisfies the second-order 
PDE 


(3.44) Ho(Vy) + ¥- 0; Hj (Vy) =0, 


jl 
where Vy = (yt, Vay) and the functions H; are given by 


1 
Ho(Vo) = -K (e+ Z1Ve¢l*) 


Ay(Ve) =(G~)Ho(Ve), gel. 


(3.45) 


Here, K is the inverse function of h, defined by: 
y = h(p) => p= Ky). 


Examine the hyperbolicity of this PDE. 
3. Consider three-dimensional Minkowski space R'’? = {(t, x, y)}, with metric ds? = 
—dt® + dx? + dy”. Let S be a surface in R', given by 


y = u(t, 2). 
Show that the condition for S to be a minimal surface in R'? is that 


(3.46) (1 + u2)use — 2(ut - Ue )Uet — (1 — ur)tee = 0. 
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Show that this is hyperbolic provided u? < 1+ u2, and that this holds provided 
the induced metric tensor on S' has signature (1,1). (Hint: To get (3.46), adapt the 
calculations used to produce the minimal surface equation (7.6) in Chap. 14.) 


Exercises on nonlinear Klein—Gordon equations, and variants 


In these exercises we consider the initial-value problem for semilinear hyperbolic equations 
of the form 


(3.47) un — Au+ mu = f(u), 


for real-valued u. Here, A is the Laplace operator on a compact Riemannian manifold M, 
or on R”. We assume m > 0, and set A = /—A+4+ m?. 


1. Show that, for s > 0, sufficiently smooth solutions to (3.47) satisfy 
d s s s s 
(3.48) Gla’ rullze + IAP uellza] = 2(A°f(u), A°ue) po 


2. Using arguments such as those that arose in proving Proposition 1.5, show that smooth 
solutions to (3.47) persist as long as ||z(t)||z-¢ can be bounded. 
3. Note that the s = 0 case of (3.48) can be written as 


d 
(3.49) a llVullzz + m7 llullze + lluellz2] = 2 f uef(u) dv. 
M 


Thus, if f(w) = g’(u), we have 

(3.50) [| Vu(#)|[22 + m2|lu(t)|22 + lleue(t)|22 — [alu a = const 
Deduce that 

(3.51) g <0 => |lu(t)|Za + |lue(4)||Z2 < const. 

4. Deduce that (3.47) is globally solvable for nice initial data, provided that f(w) = g’(u) 


with g(u) < 0 and that dim M =n =1. 
5. Note that the s = 1 case of (3.48) can be written as 


d 
a [| Lullz2 + ||Vuellz2 + mle 2] 
= 2(Vf(u), Vur) ,2 =F 2m? (f(u), is) ids 


(3.52) 


where L = —A + m?. Assume dim M = n = 3, so that, by Proposition 2.2 of 
Chap. 13, 


H'(M) c L°(M). 
Deduce that the right side of (3.52) is then 


SVE) |lz2 + Vuellz2 +m? f(wllz2 +m? llullz2 


(3.53) 
SCllf (Wllis|Lullz2 + m7 || f(w)llz2 + [Vuellz2 +m? lull. 
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(Hint: To estimate || f’(u)Vull72, use |lvw||Z2 < ol zz» wll ope with 2p’ = 6, so 
2p = 3.) 
6. If f(u) = —u*, then || f’(u)[ 23 < 9llullzo < Cllullza and || f(w)llz2 < |lullzo- 
Making use of (3.51) to estimate ||u|| z71, demonstrate global solvability for 
(3.54) un — Aut mu = —u, 


with nice initial data, given that dim M =n = 3. 


For further material on nonlinear Klein—Gordon equations, including treatments of 
(3.54) with uw replaced by wu”, see [Gril, Ra, Re, Seg, St, Str]. 


In Exercises 7-12, we consider the equation (3.47) under the hypotheses 
(3.55) fO)=0, |fOW |< Ce, £21. 


An example is f(u) = sin u; then (3.47) is called the sine-Gordon equation. 
7. Show that if w is a sufficiently smooth solution to (3.47), and we take the “energy” 
E(t) = ||Au()[I72 + llue@llz2. then 


dE 
—_< E(t 
rr <C+CE(t), 


and hence 
(3.56) Ilw(t)Ila2 < C(t). 
This partially extends Exercise 3, in that f(u) = g/(u), with g(u) < Cilul?. 


8. Deduce that (3.47) is globally solvable for nice initial data (given (3.55)), provided 
that n = 1. 


In Exercises 9-11, assume that n > 2 and that u(0) = uo € H°(R”), we(0) = ui € 
H*—1(R"), s>n/2+1. 
9. Establish an estimate of the form 


(3.57) Ilu(t)Ilu2 < C(e), 


and deduce that (3.47) is globally solvable (given (3.55)), provided n = 2 or 3. (Hint: 
Write u(t) = v(t) + w(t), where v(t) solves 


vit — (A — m?)v =0, v(0)=uo, v(0) =u, 
and 
t ] — 
(3.58) w(t) = i. ans f(u(s)) ds. 
0 
To get (3.57) from this, establish the estimate 


(3.59) IIF(u(t) Iles < Ct), 


from (3.56).) 
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10. Suppose n = 4. Show that 
(3.60) Ilu(t)Ils < C(é), 


and deduce that (3.47) is globally solvable (given (3.55)), provided n = 4. 
(Hint: Start with ||0; 0% (w)||z2 < C1||0j;0.u||p2 + C2||Vull?.s, and use the Sobolev 


estimate 
(3.61) liellz2n@ny < CllVelliz@ny, P=— 5, 
to deduce that 
(3.62) n=4 => |) f(u)llz2 < Cllullze + Cllullize- 


Then use (3.58) to estimate ||w(£)|| 73.) 

11. Show that (3.60) also holds when n = 5. Deduce that (3.47) is globally solvable 
(given (3.55)), provided n = 5. (Hint: Start with ||0;0; f(u)||L2 < Ci|/O;O,ul|ze + 
C2||Vull7.2p, and apply (3.61), with p = 5/3, to get 


(3.63) n=5 => |[f(u)llq25/3 < C(t). 

Use the Sobolev imbedding result H°?(R”) C L"?/("~2P) (R") to deduce 
(3.64) IIF(ut)) 2-1/2 < CE). 

Use (3.58) to deduce 
3.65) n=5 = lull passe < CO. 


Iterate this argument, to get (3.60).) 
12. Derive results on the global existence of weak solutions to (3.47), under the hypothesis 
(3.55), analogous to those in Exercises 12 and 13 of § 1. 


For further results on the equation (3.47), under hypotheses like (3.55), but more 
general, see [BW] and [Str]. 


Exercises on wave maps 


In these exercises, we consider the initial-value problem for semilinear hyperbolic 
systems of the form 


(3.66) ur — Au = B(x, u, Vu), 


where B(x, u, p) is smooth in (x, w) and a quadratic form in p. Here, A is the Laplace 
operator on a compact Riemannian manifold X, u(t, a) takes values in R‘, and Vu = 
Vi 1x U. 


1. Show that, for s > 0, sufficiently smooth solutions to (3.66) satisfy 


d s s s s 
(3.67) Gq lllVeA u(t)||Z2 + |lOeA°u(t)|[?) = 2(A° us, A®B(a, u, Vu) ,o- 
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2. Using arguments such as those that arose in proving Proposition 1.5, show that smooth 
solutions to (3.66) persist as long as ||u(t)||¢1 + ||O,u(£)|| z-¢ can be bounded. 

3. Suppose that, fort € I, u(t,x) solves (3.66) and u : I x X — N, where N isa 
submanifold of R‘. Suppose also that, for all (¢,7) € I x X, 


(3.68) B(ax,u, Vu) L TuN. 
Show that 
1, yo 1 2 
(3.69) e(t, x) = 5 ltl + glVeul , E(t)= | e(t,x) dV(z), 
xX 
satisfies 
dE 
.70 —~=0. 
(3.70) 7 0 


(Hint: The hypothesis (3.68) implies uz - B(x, u, Vu) = 0. Then use (3.67), with 
s=0.) 


In Exercises 4—6, suppose X is the flat torus T”, or perhaps X = R”. Assume 
(3.68) holds. Define 


(3.71) m;(t, x) = ut: Oju. 
4. Show that 

Oe Om; -_ 
(3.72) a » aa 0. 


(Hint: Start with O,e = uz - wet + Veu- Veut, and use the equation (3.66); then use 
ut: B(x, u, Vu) = 0.) 
5. Show that, foreach 7 = 1,...,n, 


Om; 


(3.73) DE 


(6) 
ss a = d1a(Aiu - ju) — O;(Oiu- Oyu) }. 


(Hint: Use 0ju- B(x, u, Vu) to get im; = Au-O;u+ ur - 0; ut; then compute O;e 
and subtract.) 


The considerations of Exercises 1-5 apply to the “wave map” equation 


(3.74) uit — Au = [(u)(Vu, Vu), 
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where Vu = Vi,cu and P'(u)(Vu, Vu) is as in the harmonic map equation (10.25) 
of Chap. 14. Indeed, (3.74) is the analogue of the harmonic map equation for a map 
wu: M — N when N is Riemannian but / is Lorentzian. 

6. Suppose n = 1. Then _X = S’ (or R*). Show that (3.74) has a global smooth solution, 
for smooth Cauchy data, u(0) = f, uz(0) = g, satisfying f : X > N, g(x) € 
T(x) N. 

(Hint: In this case, (3.72)—(3.73) imply Oze - Oe = 0, which gives a pointwise bound 
for e(t, z).) This argument follows [Sha]. 


The paper [Sha] also has results in higher dimensions, including global weak solu- 
tions and singularity formation. The study of wave maps has become quite an active 
area. For further literature, see [ShS, Tat, Tat2, Tao, Tao2, CKLS, Rod, JLR], and, for an 
overview making contact with other types of dispersive wave equations, [IT]. 


4. Equations in the complex domain 
and the Cauchy—Kowalewsky theorem 


Consider an mth order, nonlinear system of PDE of the form 


o™u 
(4.1) aim 
u(0, x) = go(a),... OF” —"u(0, x) = gm_-1(2). 


=AG2, DP uD? Oni 2.., 0,28" "a, 


The Cauchy—Kowalewsky theorem is the following: 


Theorem 4.1. /f A is real analytic in its arguments and g; are real analytic, for 
x € O CR”, then there is a unique u(t, x) that is real analytic for x € O1 CC 
O, t near 0, and satisfies (4.1). 


We established the linear case of this in Chap.6. Here, in order to prove 
Theorem 4.1, we use a method of Garabedian [Gb1, Gb2], to transmutate (4.1) 
into a symmetric hyperbolic system for a function of (¢,2,y). To begin, by a 
simple argument, it suffices to consider a general first-order, quasi-linear, N x N 
system, of the form 


0 “ 0 
(42) = ps Ay (tu) 5 + f(t,x,u), u(0,2) = g(a). 


We assume that A; and f are real analytic in their arguments, and we use these 
symbols also to denote the holomorphic extensions of these functions. Similarly, 
we assume g is analytic, with holomorphic extension g(z). We want to solve (4.2) 
for u which is real analytic, that is, we want to extend wu to u(t, x, y), so as to be 
holomorphic in z = x + iy, so that 
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Ou _ Ou = 


4. elie adn 
ae Ox; ‘Oy; 


Now, multiplying this by 7B; and adding to (4.2), we have 


ib = Ou 
(4.4) = Ds + iB; in; 2s rit f(t, «,u). 


We arrange for this to be symmetric hyperbolic by taking 


1 * 
(4.5) B;(t, z,u) = 5 (Ai — Aj). 
Thus we have a local smooth solution to (4.4), given smooth initial data 
u(0,2,y) = g(x,y). Now, if g(x, y) is holomorphic for (2, y) € U, we want to 
show that u(t, x, y) is holomorphic for (7, y) € U; C U if t is close to 0. To see 
this, set 


(4.6) 4 =5(4 SH) Ou 


Then 


(4.7) 7 
+ S| (idz, Bj); + Oz, f(t, z,u). 


8;,A;(t,z,u) = 5> Sv! = C(t, z, u)wv, 


and similarly Oz, f(t, z,u) = F(t, z, u)uv. Thus 


Ov, us ; Ov, = Ovy ” 
(4.8) a S [Aj + 7B;] ae yy B; Out Ev, + >> Gis;, 


a1 


with 
E=) C(t, z,u) + F(t,z,u), Gj = 10z B;. 
J 


This is a symmetric hyperbolic, (Nn) x (Nn) system forv = (vhs 1<p< 
N,1<v <n). The hypothesis that g(x, y) is holomorphic for (x, y) € U means 
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v(0,x,y) = 0 for (x,y) € U. Thus, by finite propagation speed, v(t, x,y) = 0 
on a neighborhood of {0} x U. 

Thus we have a solution to (4.2) which is holomorphic in x + zy, under the 
hypotheses of Theorem 4.1. We have not yet established analyticity in t; in fact, 
so far we have not used the analyticity of A; and f in t. We do this now. As above, 
we use Aj, f, and u also to denote the holomorphic extensions to ¢ = ¢ + is. 
Having u for s = 0, and desiring 0u/Ot = —i0u/0s, we produce u(t, s, x, y) as 
the solution to 


du oo Ou, : 
(4.9) a. yy A; Bn, +if, u(t,0,x,y) = solution to (4.4). 


ing 2b.’ to (4.3) and adding to (4.9), we get 
Applying iB% to (4.3) and adding to (4.9), we g 


Ou = Ou = Ou 
41 — = (Ave |— = \ Br ae 
(4.10) ; at 5 +iBT) 5S oF iy tf 


which we arrange to be symmetric hyperbolic by taking 

# ve 
(4.11) Brits, y) = ai + Aj). 
To see that the solution to (4.9) is holomorphic in ¢ + is, let 


(4.12) ie 


du 8 a 
=i(F tig) = 


By the initial condition for u at s = O given in (4.9), we have w = 0 for s = 0. 
Meanwhile, parallel to (4.7), w satisfies a symmetric hyperbolic system, so wu is 
holomorphic in t+-is. This establishes the Cauchy—Kowalewsky theorem for (4.2), 
and the general case (4.1) follows easily. 

There are other proofs of the Cauchy—Kowalewsky theorem. Some work by 
estimating the terms in the power series of u(t, 2) about (0,29). Such proofs are 
often presented near the beginning of PDE books, as they are elementary, though 
many students have grumbled that going through this somewhat elaborate argu- 
ment at such an early stage is rather painful. The proof presented above reflects 
an aesthetic sensibility that prefers the use of complex function theory to power- 
series arguments. Another sort of proof, with a similar aesthetic, is given in [Nir]; 
see also [Ovs] and [Cafl]. 

There is an extension of the Cauchy—Kowalewsky theorem to systems (not 
necessarily determined), known as Cartan—Kahler theory. An account of this and 
many important ramifications can be found in [BCG3]. 
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Exercises 
1. Fill in the details of reducing (4.1) to (4.2). 


5. Compressible fluid motion 

We begin with a brief derivation of the equations of ideal compressible fluid flow 
on a region 22. Suppose a fluid “particle” has position F(t,x) at time t, with 
F(0,x) = x. Thus the velocity field of the fluid is 

(5.1) u(t,y) = Fi(t,z) Ee T,Q, y= F(t,2), 


where F; (t,x) = (0/0t)F(t, x). If y € OO, we assume that v(t, y) is tangent to 
OQ. We want to write down a Lagrangian for the motion. At any time t, the kinetic 


energy of the fluid is 
1 2 
5 | lee wF elt, y) dy 
Q 


5 | |Fitt2))Po0() ae 


2 


K(t) 
(5.2) 


I 


where p(t, y) is the density of the fluid, and po(x) = p(0, x). Thus 
(5.3) po(x) = p(t, y) detD,F (t,x), y= F(t,2). 


In the simplest models, the potential energy density is a function of fluid den- 
sity alone: 


= / W (o(t, y)) pt, y) dy 


(5.4) 
= f W(olt, F(t,2))) po(2) ay, 
Q 


Set W(p) = Q(p~'), co(x) = po(x)~'. In such a case, the Lagrangian action 
integral is 


(5.5) L(F)= [Ils [51 (t,x) |? — Q(0(z) det DF (t,2)) | po(c) dx dt 
7 


defined on the space of maps F’ : I x Q — Q, where J is an arbitrary time interval 
[to, t1] C [0, 00). We seek to produce a PDE describing the critical points of L. 
Split L(F’) into L(F) = Lx (F) — Ly (FP), with obvious notation. Then 
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d 
DLx(F)w = (Fi(t,2), w(t, F(t,2)) ) pola) dx dt 


= {fF +0: Vv, (t,y))oltsy) dy dt, 


upon an integration by parts, since (d/dt)v(t, F(t,z)) = Ov/dt + v-Vyv. We 
have set w(t, y) = w(t, x), y = F(t, x). Next, 


(5.6) 


DLy(F)w = // Q' (a(x) det D, F(t, x)) det D, F(t, x) 
:Tr( DFE, 2)" Dawi(t,2)) de dé: 


(5.7) 


Now D, w(t, x) = D,w(t, F(t,z)) = Dy w(t, F(t, z)) D, F(t, x), so 


Tr(D,F (2) “D,we2)) = Tr Dw, F2)) 


5.8 
as = div w(t, F(t,2)). 
Hence 

Div (F)w= ff Q'(olt,y)) aiv (ty) dy at 
(5.9) 


= [fo 0 vltv), alt.) dy dt. 


Since W(p) = Q(p~'), we have Q”(p"')p~? = p?W"(p) — 2pW'(p) = 
pX"'(p) if we set 


(5.10) X(p) = pW(p), 


so we can write 
G.I) Dby(Fw= ff (x"()Vyp.t(t.y))oltsy) dy at. 


Thus we have the Euler equations: 


(5.12) Os Vv X"()Vp = 0, 
Op... 
(5.13) Bt + div(pv) = 0. 


Equation (5.12) expresses the stationary condition, DL(F')w = 0 for all smooth 
vector fields w, tangent to OQ, while (5.13) simply expresses conservation of mat- 
ter. Replacing v- Vv by Vv as we have done above makes these equations valid 
when (2 is a Riemannian manifold with boundary. The boundary condition, as we 
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have said, is 
(5.14) v(t, x) ||] OQ, «Ean. 


Recognizing 0v/Ot+ Vv = (d/dt)u(t, F(t, z)) as the acceleration of a fluid 
particle, we rewrite (5.12) in a form reflecting Newton’s law fF = ma: 


(5.15) (2 Es Vow) Sg 


The real-valued function p is called the pressure of the fluid. Comparison with 
(5.12) gives p = p(p) and 


(5.16) = “ = X"(p). 


The relation p = p(p) is called an equation of state; the function p(p) depends 
upon physical properties of the fluid. 
Making use of the identity 


(5.17) div(u @ v) = (div v)u+ Vuv, 


we can rewrite the system (5.12)-(5.13), with X”(~)V/p replaced by Vp/p, in the 
form 


v)_e+ div(pv @ v) + Vp =0, 
5.18) (pv) (p ) Pp 
pt + div(pu) = 0, 
which is convenient for consideration of nonsmooth solutions. 
It is natural to assume that W(p) is an increasing function of p. One common 
model takes 


(5.19) W(p)=ap™', a>0, 1<y<2. 
In such a case, we have an equation of state of the form 
(5.20) p(p) = Ap”, A=(y-1)a>0. 


Experiments indicate that for air, under normal conditions, this provides a good 
approximation to the equation of state if we take y = 1.4. Obviously, these formu- 
las lose validity when p becomes so large that air becomes as dense as a liquid, but 
in that situation other physical phenomena come into play, and the entire problem 
has to be reformulated. 

We will rewrite Euler’s equation, letting v denote the 1-form corresponding to 
the vector field v via the Riemannian metric on 2. Then (5.12) is equivalent to 
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Ov : 
(5.21) "a +V,0=—dz'(p), X'(p)= fe dp. 


In turn, we will rewrite this, using the Lie derivative. Recall that, for any vector 
field Z, Vy»Z = LyZ + Vz, by the zero-torsion condition on V. Using this, we 
deduce that 


1 
(Lyt — Vvt, Z) = (6, Vzv) = 5 (dlel’, 2), 
so (5.21) is equivalent to 
Oo 7 1» ; 
(5.22) 55 thet = a(5 lel me (r)). 


A physically important object derived from the velocity field is the vorticity, 
which we define to be 


(5.23) & = di, 


for each ¢ a 2-form on 2. Applying the exterior derivative to (5.22) gives the 
Vorticity equation 


a€é eo - 
(5.24) 3p t £ub = 0. 


It is also useful to consider vorticity in another form. Namely, to ra we associate 
a section € of A"—?T (n = dim Q)), so that the identity 


(5.25) ENa= (E,a)w 


holds, for every (n — 2)-form a, where w is the volume form on 2, which we 
assume to be oriented. We have 
Ly&Na=Ly(EAa)—EA Lya 
= (Lek, a)w Tt (c, Lya)w 2 (div vu) (&, a)w ~~ EA Lya 
= (Lob, a)w Tv (div v) te, a)w, 


so (5.24) implies 


(5.26) - + Ly€ + (div v)E = 0. 


This takes a neater form if we consider vorticity divided by p: 


(5.27) w= 
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Then the left side of (5.26) is equal to p(Ow/dt + Lyw) + (0p/dt + Vip + 
p(div v))w, and if we use (5.13), we see that 


(5.28) —+Lyw =0. 


This vorticity equation takes special forms in two and three dimensions, 
respectively. When dim Q). = n = 2, w is a scalar field, often denoted as 


(5.29) w=p ' rot, 
and (5.28) becomes the 
2-D vorticity equation. 


Ow 
(5.30) OE +v- grad w = 0, 


which is a conservation law. 


If n = 3, w is a vector field, denoted as 
(5.31) w= p? curl v, 


and (5.28) becomes the 


3-D vorticity equation. 


0 
(5.32) a + [v,w] =0, 
or equivalently, 
Ow 
(5.33) OE + V,w— Vw =0. 


The first form (5.23) of the vorticity equation implies 


(5.34) &(0) = (F')*€(t), 

where F* (x) = F(t, a), €(t)(x) = E(t, x). Similarly, (5.28) yields 

(5.35) w(t,y) = A” *DF*(x) w(0,2), y= F(t,2), 

where DF*(x) : T;Q — T,Q is the derivative. In case n = 2, this is simply 
w(t, y) = w(0, xz), the conservation law mentioned after (5.30). 


One implication of (5.34) is the following. Let S be an oriented surface in 2, 
with boundary C; let S(t) be the image of S under F'‘, and C(t) the image of C; 
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then (5.34) yields 


(5.36) / E(t) = / E(0). 
S 


S(t) 


Since ra = dv, this implies the following: 


Kelvin’s circulation theorem. 


(5.37) | = fa. 


C(t) C 


We take a look at some phenomena special to the case dim 2. = n = 3, where 
the vorticity € is a vector field on Q, for each t. Fix tp and consider € = €(to). 
Let S be an oriented surface in Q, transversal to €. A vortex tube T is defined 
to be the union of orbits of € through S, to a second transversal surface S2. For 
simplicity we will assume that none of these orbits ends at a zero of the vorticity 
field, though more general cases can be handled by a limiting argument. 

Since dé = d?v = 0, we can use Stokes’ theorem to write 


(5.38) o= fae= fe 
T aT 
Now OT consists of three pieces: S' and S2 (with opposite orientations) and the 


lateral boundary L the union of the orbits of € from 0S to 02. Clearly, the pull- 
back of € to £ is 0, so (5.38) implies 


(5.39) fé= fé 


Applying Stokes’ theorem again, for ra = dv, we have 


Helmholtz’ theorem. For any two curves C’, C2 enclosing a vortex tube, 


(5.40) [o= fe 


Cc C2 


This common value is called the strength of the vortex tube T. 


Also, note that if J is a vortex tube at to = 0, then, for each t, T(t), the image 
of T under F", is a vortex tube, as a consequence of (5.35), with n = 3, since & 
and w = €/p have the same integral curves. Furthermore, (5.37) implies that the 
strength of T(t) is independent of t. This conclusion is also part of Helmholtz’ 
theorem. 

If we write £,v in terms of exterior derivatives, we obtain from (5.22) the 
equivalent formula 
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Ov 


(5.41) a 


~ 1 
+ (di) |v =—d( 5 |v)? + X"(p)). 
We can use this to obtain various results known collectively as Bernoulli’s law. 
First, taking the inner product of (5.41) with v, we obtain 


1/0 
5.42 (= = f) 2_ _£,X"(p). 
(5.42) ap [0 (p) 

Now, consider the special case when the flow v is irrotational (i.e., dv = 0). 
The vorticity equation (5.24) implies that if this holds for any t, then it holds for 
all t. If Q is simply connected, we can pick xp € ( and define a velocity potential 
p(t, x) by 


(5.43) p(t, x) = | 0, 
xo 
the integral being independent of path. Thus dy = v, and (5.41) implies 
Op 1 
5.44 a( = lyf? 4X! )=0 
(5.44) i a, jul“ + X’(p) 


on 2, for an irrotational flow on a simply connected domain 2. In other words, in 
such a case, 


1 
(5.45) pe t gltl +X’) = FO) 
is a function of t alone. This is Bernoulli’s law for irrotational flow. 


Another special type of flow is steady flow, for which vy, = 0 and p = 0. 
In such a case, the equation (5.42) becomes 


1 
(5.46) Ly (Slo? + X"(o)) =9, 


that is, the function (1/2)|v|? + X’(p) is constant on the integral curves of v, 
called streamlines. For steady flow, the equation (5.13) becomes 


(5.47) div(pv) =0, ie, d(p*d) =0. 


If dim Q = 2 and (. is simply connected, this implies that there is a function w on 
Q, called a stream function, such that 


1 
(5.48) pxev=dy, ie, v=—— «dy. 
p 


In particular, v is orthogonal to Vw, so the stream function ~ is also constant on 
the integral curves of v, namely, the streamlines. One is temped to deduce from 
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(5.46) that, for some function H : R > R, 
1 2 / 
(5.49) slel? + X"(p) = He) 


in this case, and certainly this works out in some cases. 
If a flow is both steady and irrotational, then from (5.44) we get 


1 
(5.50) a(5 lol? if X"(p)) = 


which is stronger than (5.46). 
We next discuss conservation of energy in compressible fluid flow. The total 
energy 


(66.51) E(t)=K(t)+Vi(t)= [{geeor +W (p(t, 2)) o(t,2) dx 
Q 


is constant, for smooth solutions to (5.12)—(5.13). In fact, a calculation gives 


(5.52) Bt) = [act dz = -{ div ®(t, x) dz =0, 
Q Q 

where 

(5.53) e(t, #) = solv? + X(p) 


is the total energy density and 


1 
(5.54) O(t,2) = (Gplel? + X"(p)e)v = (e+ p)v. 
One passes from the first integral in (5.52) to the second via 
(5.55) Ope(t, x) + div ®(t, x) = 0, 


which is a consequence of (5.12)—(5.13), for smooth solutions. 

As we will see in § 8 in the special case n = 1, the equation (5.55) can break 
down in the presence of shocks. “Entropy satisfying” solutions with shocks then 
have the property that /(t) is a nonincreasing function of t. 

Now any equation of physics in which energy is not precisely conserved must 
be incomplete. Dissipated energy always goes somewhere. Energy dissipated by 
shocks acts to heat up the fluid. Say the heat energy density of the fluid is ph. One 
way to extend (5.18) is to couple a PDE of the form 
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(5.56) O,(ph) + div(phv) = —Qe — div ®. 


In such a case, solutions preserve the total energy 


(5.57) / (e + ph) de. 


Q 


For smooth solutions, the left side of (5.56) is equal to 


p(hy + Vuh) 4 (pe div(pv)), 
so in that case we are equivalently adjoining the equation 
(5.58) p(hy + Vuh) = —e, — div ®. 
The right side of (5.58) vanishes for smooth solutions, recall, so we simply have 
hi + Vuh = 0, describing the transport of heat along the fluid trajectories. (We 


are neglecting the diffusion of heat here.) 
If we consider the total energy intensity 


1 
(5.59) es 5lel +p 'X(p) +h, 


so p€ =e+ ph, we obtain 


On(pE) + div(pEv) + div(pv) 
= je + div((e+p)v) + O(ph) + div(phv), 


whose vanishing is equivalent to (5.56). Using this, we have the augmented system 


(ou)e + div(pv @ v) + Vp =0, 
(5.60) pit div(pv) = 0, 
(pE)_ + div(pEv) + div(pv) = 0. 


As in (5.20), this is supplemented by an equation of state, which in this context 
can take a more general form than p = p(p), namely p = p(p, €). Compare with 
(5.62) below. 

We mention another extension of the system (5.18), based on ideas from ther- 
modynamics. Namely, a new variable, denoted as S, for “entropy,” is introduced, 
and one adjoins (pS); + div(pSv) = 0, to (5.18), so the augmented system takes 
the form 
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(pu)e+ div(pu @ v) + Vp = 0, 
(5.61) p, + div(pv) = 0, 
(pS); + div(pSv) = 0. 


For smooth solutions, the left side of the last equation is equal to 
(St + VyS) + S(p. + div(pv)), 


so in that case we are equivalently adjoining the equation S; + V,S = 0. 

Adjoining the last equation in (5.61) apparently does not affect the system 
(5.18) itself, but, as in the case of (5.60), it opens the door for a significant change, 
for it is now meaningful, and in fact physically realistic, to consider more general 
equations of state, 


(5.62) p= p(p, 5). 
In particular, one often generalizes (5.20) to 
(5.63) p= A(S)p’. 


Brief discussions of the thermodynamic concepts underlying (5.61) can be 
found in [CF] and [LL]. In [CF] there is a discussion of how the system (5.60) 
leads to (5.61), while [LL] discusses how (5.61) leads to (5.60). 

It must be mentioned that certain aspects of the behavior of gases, related to 
interpenetration, are not captured in the model of a fluid as described in this sec- 
tion. Another model, involving the “Boltzmann equation,” is used. We say no 
more about this, but mention the books [CIP] and [RL] for treatments and further 
references. 


Exercises 


1. Write down the equations of radially symmetric compressible fluid flow, as a system in 
one “space” variable. 


6. Weak solutions to scalar conservation laws; 
the viscosity method 


For real-valued u = u(t, x), we will obtain global weak solutions to PDE of the 
form 


(6.1) A= DLR (wy), uO) =f, 
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fort > 0, « € T” = M, as limits of solutions u, to 


OU, 
(6.2) a =vAu,+5~aFi(u), w(0) =f. 


This method of producing solutions to (6.1) is called the viscosity method. Recall 
from Proposition 1.5 of Chap. 15 that, foreach vy > 0, f € L™~(M), (6.2) has a 
unique global solution 

uy € L®([0,00) x WM) C%((0, 00) x M), 
with 


(6.3) Iu) ||n° < [If llz-, 


for each t > O, and furthermore if uj, solve (6.2) with u;, = f;, then, for each 
t>0, 


(6.4) I|wiv(t) — uar(t)|lzr < Lf — fallzs, 

by Proposition 1.6 in that section. We will use these facts to show that as v \, 0, 
{uv} has a limit point uw solving (6.1), provided f € L~(M)M BV(M), where, 
with M(/) denoting the space of finite Borel measures on M, 

(6.5) BV(M) ={ueED'(M): Vue M(M)}. 

As shown in Chap. 13, § 1, BV(M) c L"/("-1)(M). Of course, that BV C L°° 
for n = 1 is a standard result in introductory measure theory courses. Our analysis 
begins with the following: 

Lemma 6.1. Jf f © BV(M) and wu, solves (6.2), then 

(6.6) {u, :v € (0, 1]} is bounded in L* (Rt, BV). 

Proof. If we define 7, f(x) = f(x + y), it is clear that 

(6.7) fe BV = |lf -— fll < Clyl, 

for |y| < 1/2. Now apply (6.4) with f; = f, fo = T,f to obtain, for each t > 0, 
(6.8) I[ur (t) — Tye (4) [121 < Clyl, 

which yields (6.6). 


Now if we write 0;Fj(u,) = F’(uv)Oj;uv, and note the boundedness in the 
sup norm of F(u,), we deduce that 
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(6.9) {0;F;(uv) : v € (0, 1]} is bounded in L*(R*, M(M)). 
Let us use the inclusion 
(6.10) M(M) Cc H~*?(M), p'd>n, 
a consequence of Sobolev’s imbedding theorem, which implies 
(6.11) BV(M) c H'-*?(M). 


We think of choosing 6 small and p close to 1. We deduce from (6.6) and (6.9) that 
{vAut+ > 0;F;(u,)} is bounded in L*(R*+, H~1~*?(M)), and hence, by (6.2), 


(6.12) {0,u,} is bounded in L°(R*+, H~1~*?). 
Thus, for t, t’ > 0, 
(6.13) || uv (t) — uv (t’) || 7-1-8.2(M) < Clit —t'|, 


with C independent of v € (0, 1]. We now use the following interpolation inequal- 
ity, a special case of results established in Chap. 13: 


llullzee < Cllull 25° llvll 1-00, 
where o € (0,1) ande = (1 —o)(1— 6) + o0(—-1 — 6) = 1-20 —5+4+ 00 is 


> 0 if o is chosen small and positive. We apply this to (6.13) and the following 
consequence of (6.6) and (6.11): 


(6.14) Iluy(t) — w(t’) || Ha-5 < C, 
to conclude that, for some o > 0, € > 0, 
(6.15) {u,} is bounded in C° ((0,T], H*?(M)), 
for all T’ < co; hence, by Ascoli’s theorem, 
(6.16) {uy} is compact in C((0, 7], L?(M)), 
for all T’ < 00. 
From here, producing a limit point u solving (6.1) is easy. Given T’ < ov, by 


(6.16) we can pass to a subsequence 1, —> 0 such that 


(6.17) Ur, =U, >u in C([0,T], L?(M)); 
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by a diagonal argument we can arrange this to hold for all T’ < oo. We can also 
assume u;(t, 7) — u(t, x) pointwise a.e. on R* x M. In view of the pointwise 
boundedness (6.3), we deduce 


(6.18) Fi(ux) > Fj(u) in L?((0,T] x M), 
as k — oo, for each J’. Hence we have weak convergence: 


(6.19) OE > Be’ 


VypAupy +0, Oj Fj (ux) + O;F;(u), 
implying that wu solves (6.1). We summarize: 


Proposition 6.2. Given f ¢ L~(M)N BV(M), the solutions u, to (6.2) have a 
weak limit point 


(6.20) u € C((0, 00), L?(M)) NL™(R* x M)N L® (Rt, BV(M)), 
for all p < ©, solving (6.1). 


As we will see below, weak solutions to (6.1) in the class (6.20) need not be 
unique. However, there is uniqueness for those solutions obtained by the viscosity 
method. A device that provides a proof of this, together with an intrinsic char- 
acterization of these viscosity solutions, is furnished by “entropy inequalities,” 
which we now discuss. 

Let 7 : R — R be any C?-convex function (so 7 > 0). Note that, for v = 
v(t, x), On(v) = n'(v) Av and 03n(v) = n'(v) Fu + nf’(v) (jv), so 


An(v) = 1! (v)Av + 9"(v)|Vav)’. 


Thus, if u,, solves (6.2), and if we multiply each side by 7/(u,), we obtain 


0 1 
(6.21) Ryu) = vAn(uy) — vn (uy) |Vur|? a3 > Ojq; (uw), 


where, using 7'(v) O;Fi(v) = nj (v)Fi(v) 0jv and Ojqj(v) = gj (v) Ojv, we 
require of q; that 


(6.22) qj(v) = 7'(v) F5(v). 


Now, for u,, —> wu as above, we have derived weak convergence n(w,,) 
n(u) and, by the same reasoning, q;(u,,) — qj(u), but we have no basis to 
say that |Vu,,|? — |Vul?, and in fact this convergence can fail (otherwise the 
inequality we derive would always be an equality). Taking this into account, we 
abstract from (6.21) the inequality 
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0 


(6.23) aL 


n(uv) — D> Ojqj (ur) < vAn(u,), 


using convexity of 77, and then, passing to the limit u,, — u, obtain 


a 
(6.24) ale) — S > jaj(u) <0, 


in the sense that we have a nonpositive measure on (0,00) x M on the left side. 
In other words, 


(6.25) yp € C$((0,00) x M), y>0, 


implies 


(6.26) kG u(t,2)) 9 — S> ay (ul ae} inde = 0: 


By a limiting argument, we can let 7(u) tend to |u — k|, for any given k € R, 
and use q;(u) = sgn(wu — k)[Fj(u) — F;j(k)], to deduce (using the summation 
convention) 


(6.27) [fire ky, — sgn(u — k)[F;(u) — Fj (k)] ao} dx dt > 0, 

for all vy satisfying (6.25). That (6.27) holds for all k € R is called Kruzhkov’s 
entropy condition. The following is Kruzhkov’s key result: 

Proposition 6.3. [fu and v belong to the space in (6.20) and both satisfy Kruzh- 
kov’s entropy condition, and if u(0,x) = f(x), v(0,x) = g(x), then, for t > 0, 


(6.28) I(t) — v(é)|lz2 < If — glo. 


Proof. Let us write the entropy condition for v in the form (using the summation 
convention) 


(6.29) [fe flys — sgn(v — £)[F;(v) — F;(0] 5c} dy ds > 0, 


for all 2 € R. Let y = g(s,t,x,y) be smooth and compactly supported in 
s >0,t>0, and y > 0. Now substitute v(s,y) for & in (6.27), u(t, x) for ¢ 
in (6.29), integrate both over dz dy ds dt, and sum, to get 


[fff vw u(t, x) — o(s,y)|(~e + Ps) — sen(ult, x) — v(s,y)) 


. [Fi(u) — Fj(v m2 pee <?\\ de dy ds dt > 0. 


Oy; 


(6.30) 
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We now consider the following functions y: 


(6.31) Y(s, t, zr, y) 4 f(t)dn(t ~~ 8) dn (a _ y); 
where f, dp, 6, > O and, as h — 0, dp, and 6), approach the delta functions on R 


and T” = M, respectively. With such a choice, note that Op/Ox; + Op/Oy; = 0 
and y: + Ys = f’(t)dn(t — 5), (x — y). Passing to the limit h — 0 yields 


(6.32) ul lu(t, x) — v(t,2)| f/(#) de dt > 0, 
for all nonnegative f € CG° ((0, oo)) , which in turn implies 


d 
(6.33) = Mult) — vlan: <0, 


yielding (6.28). 


We have given all the arguments necessary to establish the following: 


Corollary 6.4. Given f © L°(M)NBV(M), the weak solutions to (6.1) belong- 
ing to the space (6.20) which are limits of solutions u, to (6.2) are unique. Given 
two such f;, initial data for viscosity solutions u;, we have 


(6.34) [ua (¢) — wa(4) [z+ < [fi — falls, 


fort => 0. Furthermore, a weak solution to (6.1) is a viscosity solution if and only 
if the entropy inequality (6.27) holds for all k € R. 


As a complementary remark, we note that if u, belonging to (6.20), satisfies 
Kruzhkov’s entropy condition, then automatically u is a weak solution to (6.1). 
Indeed, let v be the viscosity solution with the same initial data as wu; by (6.28), 
v=U. 

Note that (6.27) can be rewritten as 


(6.35) ffiie- aif gt J) Gi(u,k) 3 Fe} ae de> 0 


where Gj(u,k) = [Fj(w) — F;(k)]/(u — &) is smooth in its arguments. The 
formula (6.30) can be similarly rewritten; also, (6.32) can be generalized to 


(6.36) / |u(t, x) —v(t,x {2% -»S G;(u,v) me} a dt > 0, 


for a pair u, v satisfying Kruzhkov’s entropy condition. Suppose their initial data 
are bounded in sup norm by M, which therefore bounds u(t) and v(t) for all 
t > 0; pick A so that 
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(6.37) |u|, lv] < M => S°G;(u,v)? < A?. 


Now pick y(t, x) = f(t)W(t, x), with f as above and w satisfying 


(6.38) v,+ A|V2¥| <0, 
so that 
(6.39) be + D0 1G; (u,»)| - Oj] <0. 


Then (6.36) implies 


(6.40) i, i lu(t,c) — v(t, 2)|f"(eW(t, 2) dr dt > 0. 


By a limiting argument, we can let w be the characteristic function of a set in 
(0,00) x T” of the form 


(6.41) {(t, x) : |x —ao| < B— At}. 


Then, refining (6.33), we deduce that 


(6.42) / |u(t, x) — v(t, 2)| dr = D(t)\ ast 7. 


|x—a9|<B-—At 


In particular, if u(0, x) = v(0, x) on {x : |a—a9| < B}, we deduce the following 
result on finite propagation speed: 


Proposition 6.5. [fu and v are viscosity solutions to (6.1), bounded by M, with 
initial data f and g which agree on a set {x : |x — xo| < B}, and if A is large 
enough that (6.37) holds, then u and v coincide on the set (6.41). 


In light of this, we have in a natural fashion unique, global entropy-satisfying 
weak solutions to (6.1), fort > 0, « € R”, provided the initial data belong to 
°° (IR") and have bounded variation. 

We next consider weak solutions to (6.1) with discontinuities of the simplest 
sort; namely, we suppose that u(t, x) is defined for t > 0, x € R, and that there 
is a smooth curve ¥, given by x = x(t), such that u(t, x) is smooth on either side 
of 7, with a simple jump across 7. If (#,t) € y, denote by [uw] = [u](x, t) the size 
of this jump: 


(6.43) fu] = pa u(a(t) + e,t) — u(a(t) — ,t). 


If F : R > R is smooth, we let [F'] denote the jump in F'(w) across 7. Now, if 
such w solves 
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(6.44) u + E(u), =0 


on (Rt x R) \7, then this object on Rt x R will be a measure supported on ; u 
will be a weak solution everywhere provided this measure vanishes. It is a simple 
exercise to evaluate this measure in terms of the jumps [u] and [F'] and the slope 
of +, or equivalently the speed s = dx /dt, as being proportional to s[u] — [F]. In 
other words, such a wu provides a weak solution to (6.44) precisely when 


(6.45) s[u] =[F] ony. 


This condition is called the jump condition, or the Rankine—Hugoniot condition. 

A special case of solutions to (6.44) off + are functions u that are piecewise 
constant. Thus the jumps are constant, so s is constant, so y is a line; we may 
as well call it the line x = st (possibly shifting the origin on the z-axis). See 
Fig. 6.1. If uw = ue on the left side of y and u = wu, on the right side of +, the 
Rankine—Hugoniot condition becomes 


F(u,) = F (ue) 


Ur — Ue 


(6.46) $= 


An initial-value problem with such piecewise-constant initial data is called a Rie- 
mann problem. Let us describe two explicit weak solutions to 


(6.47) up + s(w), = 


of this form, in Fig. 6.2. 


Claim 6.6. Figure 6.2A describes an entropy-satisfying solution of (6.47), while 
Fig. 6.2B describes an entropy-violating solution. 


In each figure we have drawn in integral curves of the vector fields 0/Ot + 
F’(u)(0/02z) in the regions where u is smooth. Note that in Fig. 6.2A these curves 
run into +, while in Fig. 6.2B these curves diverge from +. 


FIGURE 6.1 Shock Wave 
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t 


(a) (b) 


FIGURE 6.2 Entropy Satisfaction and Violation 


These assertions are consequences of the following result of Oleinik: 


Proposition 6.7. Let u(t, x) be a piecewise smooth solution to (6.44) on R* x R 
with jump across 7, Satisfying the jump condition (6.45). Then the entropy condi- 
tion holds if and only if 

(i) in case Uy < Ug: 

The graph of y = F (wu) over [u,, ue] lies below the chord connecting the point 
(u,, F(ur)) to (ue, F(ue)); 

(ii) in case Uy > Ug: 

The graph of y = F'(u) over [ue, ur] lies above the chord. 


These two cases are illustrated in Fig.6.3. A weak solution to (6.44) which 
satisfies the hypotheses of Proposition 6.7 is said to satisfy Oleinik’s “condition 
(E).” 


Proof. As a slight variation on Kruzhkov’s convex functions, it suffices to con- 
sider the weakly convex functions 


n(u) = 0, foru <k, 
u—k, foru>k, 


plugged into the inequality 7, + q, < 0, with 


case (i) case (ii) 


FIGURE 6.3 Oleinik’s Condition (E) 
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q(u) = 0, foru<k, 
F(u)-—F(k), foru>k, 


as k runs over R. In fact only for k between wu,. and ue is 7 + gz nonzero; in such 
a case it is a measure supported on y, which is < 0 if and only if 


s[n(ue) — n(ur)] < F(ue) — Fur). 
The jump condition (6.46) on s then implies 


(6.48) F(k) >t F(ue) + ha 


Up — Ue Up — UE 


Flur), 


for k between u,. and ug, which is equivalent to the content of (i)—(ii). 


Note that if F’ is convex (i.e., F’’ > 0), as in the example (6.47), then the 
content of (i) and (ii) is 


(6.49) F'(ue) > s > F'(uy) (for F” > 0), 


a result that, for F(u) = u?/2, holds in the situation of Fig. 6.2A but not in that 
of Fig. 6.2B. 

For weak solutions to (6.1) with these simple discontinuities, if the entropy 
conditions are satisfied, the discontinuities are called shock waves. Thus the dis- 
continuity depicted in Fig. 6.2A is a shock, but the one in Fig. 6.2B is not. 

The Riemann problem for (6.47) with initial data ue = 0, u, = 1, has 
an entropy-satisfying solution, different from that of Fig.6.2B, which can be 
obtained as a special case of the following construction. Namely, we look for a 
piecewise smooth solution of (6.44), with initial data u(0, 2) = ug for x < 0, u, 
for x > 0, and which is Lipschitz continuous for t > 0, in the form 


(6.50) u(t, z) = v(t "2). 

The PDE (6.44) yields for v the ODE 

(6.51) v'(s)[F’(v(s)) — s] = 0. 

We look for v(s) Lipschitz on R, satisfying alternatively v'(s) = 0 and 
F’(v(s)) = s on subintervals of R, such that v(—oo) = ug and v(+oo) = uy. 
Let us suppose that F'(w) is convex (F” > 0) for u between we and wu, and that 
the shock condition (6.49) is violated (i.e., we suppose ue < u,-). Since F’(u) is 
monotone increasing on ue < u < u,, we can define an inverse map = (F’)~}, 


G: [F’(ue), F’(u,)] > [ue, ur. 


Then setting 
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t 


FIGURE 6.4 Rarefaction Wave 


UL, for s < F’(ue), 
(6.52) u(s) = 4 G(s), for F’(ue) <s < F’(u,), 
Ur, for s > F’(u,) 


completes the construction. For the PDE (6.47), with F’(u) = u, the solution so 
produced is illustrated in Fig. 6.4. There is a fan of lines through (0,0) drawn in 
this figure, with speeds s running from 0 to 1, and u = s on the line with speed 
dz /dt = s. 

Solutions to (6.44) constructed in this fashion are called rarefaction waves. 
If F' is concave between ue and u,, an analogous construction works, provided 
Ug > Up. 

Rarefaction waves always satisfy the entropy conditions, since if u is a weak 
solution to (6.44), 7(u): + q(u)z = 0 on any open set on which wu is Lipschitz. 

In case F'(u) is either convex or concave over all of IR, any Riemann problem 
for (6.44) has an entropy-satisfying solution, which is either a shock wave or a 
rarefaction wave. In these two respective cases we say we is connected to u,. by 
a shock wave or by a rarefaction wave. If F’’(u) changes sign, there are other 
possibilities. We illustrate one here; let u, < ug, and say F'(u) is as depicted in 
Fig. 6.5 (with an inflection point at v1). 


u, ‘y, ~V uy 


FIGURE 6.5 More Complex Nonlinearity 
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s = F'(u)) s=F'(v) s=([F)(u] 


uy v 


FIGURE 6.6 Rarefaction and Shock 


5 = F'(v2) = [FY [u] 


FIGURE 6.7 Rarefaction Bounded by a Shock 


By the analysis above, we see that if vz} < v < v2, we can connect ug to v 
by a rarefaction wave, and we can connect v and u, by a shock, as illustrated in 
Fig. 6.6. 

These can be fitted together provided [F'(v) — F(u,)]/(v — ur) > F’(v). This 
requires v = vg, so the solution is realized by a rarefaction wave bordered by a 
shock, as illustrated in Fig. 6.7. 

We now illustrate the entropy solution to u;+ (1/2)(u?). = 0 with initial data 
equal to the characteristic function of an interval, namely, 


u(0,7)=1, forO<a<1, 
(6.53) d 
O otherwise. 

For 0 < t < 2, this solution is a straightforward amalgamation of the 
rarefaction wave of Fig. 6.4 and the shock wave of Fig. 6.2A. Fort > 2, there is an 
interaction of the rarefaction wave and the shock wave. Let (x(c),t(a)) denote 
a point on the shock front (for t > 2) where u = o. From [u] = 0, [F] = 07/2, 
and s = [F]/[u] = 0/2, we deduce 


Hence x’ /x = t'/2t, so log = (1/2) logt + C, or a = kt*/?. Since 2 = 2 at 
t = 2 on the shock front, this gives k = \/2. Thus the shock front is given by 


(6.54) a = V2t, fort > 2. 
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(x(0), #(0)) 


u=oOonline x =of 


u=1 


FIGURE 6.8 Curved Shock Front 


This is illustrated in Fig. 6.8. Note that the interaction of these waves leads to 
decay: 


2 
(6.55) sup u(t,2) = i? for t > 2. 


Exercises 


Exercises 1-3 examine a difference scheme approximation to (6.1), used by [CwS] and 
[Kot]. Let h = At, ¢ = Aa;, and let A be the n-dimensional lattice 


A={x ER": x=ca,a€Z"}. 


We want to approximate a solution u(t, x) to (6.1) att = hk, x = ea, by u(k, a), 
defined on Z* x A, satisfying the difference scheme 


. cc 1 1,a) - Sofulke + 6(j)) +u(k, a au)} 
(6.56) . _ 
+5. DoF (uls2-+ 669) ~ F (ulb a — 64) } =. 
for k > 0, with initial condition 
(6.57) u(0, a) = f(a). 


Here, 6(j) = (0,...,1,...,0), with the 1 in the jth position. We impose the “stability 
condition” 


E 0 
6.58 0<h<—, A=max su F’(w)|. 
6.58) <= ax sup LF} (w) 
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1. Show that 

(6.59) sup |f(a)| < M => |u(k,a)| < M. 
(Hint: Write F;(u(k, « + 6(j))) — Fy (u(k, a — 5(j))) = Prag [u(k, a + 6(j)) — 
u(k, « — 6(j))]. Then rewrite (6.56) as 


uk + 1a) = + ){u(k,a + 6(3)) + u(k,o— 6(3))} 
(6.60) 7 


Hence 


where Ye Krag =1, and, given (6.58), Krag > 0. Deduce that Ju(k + 1,a)| < 
supg |u(k, 3)|.) 
2. If u(k, a) solves (6.56) with v(0, a) = g(a), show that 


(6.61) S>|ulk, a) — v(k,a)| < S| f(a) 
acA acA 


Compare with (6.4). Deduce that 


(6.62) Se) ws) <2 VIF) f(a+6(9))|. 


j=laca j=H=lacad 


(Hint: With w(k, a) = u(k, a) — v(k, a), deduce from (6.56) that 


n 


w(k+1,a) = = Di{ulka + 6(j)) + w(k, a 6(3))} 


j=l 


— . 
7 Do {rarsayw(’s a+ 5(j)) _ Vh,a—5(jw(k, a- ())}, 
j=l 


(6.63) 


where F(u(k,a)) — F;(v(k,a)) =WVpaw(k, a). Multiply (6.63) by oka = sgn w 
(kK + 1, a) and sum over a, to get 


x jw(k+1,a)| = So Yeaw(k, a), 


where 


1 nh nh 
m 2 {(1 - a Wises) ha0(5) + (1 + Wier) oho }- 
go 


Using 1 + (nh/e)Vaq > 0, deduce that —1 < ra < 1.) 
3. Show that 


Va = 
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Soh uk + 1a) — u(k,a)| < ASS Se" | F(a + 6(9)) — F(a)| 
a jJ=laca 


(6.64) 


(Hint: Set v(k,a@) = u(k + 1,q@), and apply (6.61). Then use (6.56) to analyze 
u(1,a) — f(a).) 


Let us use the notation 


Avu(k, a) ARG H1,a)— 5 S{u(k,a + 6(9)) +u(io40))}) 


(6.65) pam 
1 
Aju(k,a) = <-|v(k, «+ 8()) — o(k,a - 0(9)], 
so (6.56) takes the form 
(6.66) Au + S> A; F;(u) =0. 


The following is a special case of a result in [L4]. 
4. Let 7 and q; be as in (6.22). Assume 


0<m<n"(u)<M<o, 


and strengthen (6.58) to 


h< —( 1+ -1). 


Show that a solution u to (6.66) also satisfies 


(6.67) Ain(u) + 5° Ajqj(u) < 0. 
j 


Compare with (6.24). 
5. Let uo(t, 2) be the entropy solution to uz + (1/2)(u?)« = 0 with initial data 


Us(0,4) =a ', for0<a<a, 


0 otherwise. 


Compare u, to the solution to (6.53), illustrated in Fig. 6.8. 

Note that, given 0 < o < 1, we have u(t, 7) = u1(t, x) for large t, so there is no 
backward uniqueness. 

Show that aso — 0, us — uo, depicted in Fig. 6.9. Show that wo is an entropy solution 


of 
es (ta 0. ae eie: 
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u=@8onx = 61, 


O<x< V2 


FIGURE 6.9 Solution with Initial Data 5 (a) 


7. Systems of conservation laws in one space variable; 
Riemann problems 


Here we consider L x L first-order systems of the form 
(7.1) uz + F(u)2 = 0, 


where x belongs to either R or S! = R/Z. Here, u takes values in R”, or perhaps 
in some region Q C R¥, and F : Q - R® is smooth. Assume Q is simply 
connected. If wu is a smooth solution of (7.1), then 


(7.2) up + A(u)uz =0, A(u) = Dy F(u). 


Thus A(u) is an L x L matrix. We typically make the hypothesis of strict hyper- 
bolicity, that A(w) has L real and distinct eigenvalues: 


(7.3) A(u)r;(u) = A;(u)rj(u), Ar(u) < +++ < Az(u). 


The vectors r;(u) € R¥ are eigenvectors of A(u). 

The equation (7.1) is said to be a system of conservation laws because, if 
u(t, x) either vanishes sufficiently rapidly as x —+ +oo or is defined for x € S', 
then 


(7.4) [ueo dx =C 


is independent of ¢; thus the components of this vector are conserved quantities. 
To see this, using (7.1), we have 
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5, f wea) de = — f Plus dr =0, 


by the fundamental theorem of calculus. As we will see in §8, (7.1) will 
sometimes give rise to other “conservation laws” for wu. 

We give a couple of examples of systems of conservation laws. First consider 
equations of isentropic compressible fluid flow. When x € R” and n = 1, then 
the system (2.11) for compressible fluid flow specializes to 

Ut T Uy = ay 
(7.5) p 
pt + Ups + PUz = 0. 


We assume p = p(p) is a given function of p, the most common relation being 
(7.6) p(p)= Ap’, A>, 1<7<2, 
as in (2.12). We can rewrite (7.5) in conservation form: 


1 
ut (5° + q(p)),, = 9, 
pt + (pv)x = 0, 


Cn 


where q'(p) = p'(p)/p. If p(p) is given by (7.6), we can take 


AL 


= 


q(p) = 
Alternatively, we can set m = pv, the momentum density, and rewrite (7.5) as 


pr_tm, = 9, 


eo m+ (= +p) =0. 


In this case, we have u = (p,m) and 


0 1 0 1 
79) Alu) = ( a4 ) = ey 
-%+p(o) —u* +p'(p) Qu 


which has eigenvalues and eigenvectors: 


(7.10) At =vtVp'(p), r= ( ; ) ‘ 


As a second example, consider this second-order equation, for real-valued V: 
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(7.11) Vie — K(Va) x = 0, 


which is a special case of (3.12). As discussed in § 1 of Chap. 2, this equation 
arises as the stationary condition for an action integral 


(7.12) J(V) = {lev ~F(V,)| dx dt. 

Here, F'(V,,) is the potential energy density. Thus A’(v) has the form 
K(v) = F'(v). 

If we set 

(7.13) v=V,, w=YV,, 

we get the 2 x 2 system 


V_ — We = O~7 


7.14 

( ) Wt — K(v)e = 0. 

In this case, u = (v, w) and 

(7.15) A(u) = a. Ky =F") 
~\-Ky O77? 7 


We assume F”’(v) > 0. Then (7.14) is strictly hyperbolic; A(w) has eigenvalues 
and eigenvectors 


(7.16) M=Ht/Ky, r= & ) 


As in the scalar case examined in § 6, we expect classical solutions to (7.1) to 
break down, and we hope to extend these to weak solutions, with shocks, and so 
forth. Our next goal is to study the Riemann problem for (7.1), 


u(0,2) = ue, fora <0, 
(7.17) 
ur, forx>0, 


given Ug, Ur € R®, and try to obtain a solution in terms of shocks and rarefaction 
waves, extending the material of (6.43)-(6.52). 
We first consider rarefaction waves, solutions to (7.1) of the form 
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(7.18) u(t, 2) = y(t-t2), 

for p(s) which is Lipschitz and piecewise smooth. Now u, = —(a/t?)y’(a/t) 
and uz = (1/t)y(a/t), so (7.2) implies 

(7.19) (A(y(s)) — s) y'(s) = 0. 

Thus, on any open interval where y’(s) 4 0, we need, for some 7 € {1,..., LD}, 
(7.20) Aj(v(s))=8, — ¥(s) = a5(s)rj(¥(s)), 


where r;(u) is the A;-eigenvector of A(u) and a;(s) is real-valued. Differentiat- 
ing the first of these identities and using the second, we have 


(7.21) a;(s) rj (y(s)) VA; (y(s)) lly 

We say that (7.1) is genuinely nonlinear in the jth field if r;(u) - VA;(u) is 
nowhere zero (on the domain of definition, 9 C R”). Granted the condition of 
genuine nonlinearity, one typically rescales the eigenvector 7°;(u), so that 
(7.22) rj(u)- VA;(u) = 1. 

Then (7.20) holds with a;(s) = 1. 

Consequently, if (7.1) is genuinely nonlinear in the jth field and up € RY is 
given, then there is a smooth curve in R®, with one endpoint at ue, called the 
j-rarefaction curve: 

(7.23) gi(ueT), OX T<a;, 
for some o; > 0, so that 
(7.24) v3 (ue; 0) = ue, 
and, for any o € (0, 05); the function u defined by 
Up, for - < A;(ue), 
7. v rT 
(7.25) u(t,z) = 4 9j(ues7), for read E [Aj (we), Az (95 (ue; 7))], 
pj (uo) =u,r, for ; > Ay tle) 


is a j-rarefaction wave. See Fig. 7.1. Note that given (7.22), we have 


d 
(7.26) api (ues 0) = 75 (ue): 
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FIGURE 7.1 J-Rarefaction Wave 


Next we consider weak solutions to (7.1) of the form 


u(t,v) = ue, fora < st, 
(7.27) 
ur, forz> st, 


for t > 0, given s € R, the “shock speed.” As in (6.45), the condition that this 
define a weak solution to (7.1) is the Rankine—Hugoniot condition: 


(7.28) s[u] = [F], 


where [u] and [F'] are the jumps in these quantities across the line x = st; in other 
words, 


(7.29) F(u,) — F(ue) = s(u,y — ue). 

If course, if L > 1, unlike in (6.46), we cannot simply divide by u, — ue; the 
identity (7.29) now implies the nontrivial relation that the vector F'(u,) — F'\(ue) € 
R” be parallel to u, — ue. We will produce curves Qj (ue;7), smooth on rT € 
(—7;, 0], for some 7; > 0, so that 


(7.30) pj (ue; 0) = ue, 


and, for any 7 € (—7;,0], the function u defined by (7.27) is a weak solution to 
(7.1), with 


(7.31) Ur = 95(ue;T), $= 8;(T), 


where 8;(T) is also smooth on (—7;, 0]. For notational convenience, set y(7) = 
pF (ue; T). Thus we want 


(7.32) F(p(r)) — F(ue) = s;(7)(y(r) — ue). 


If this holds, then taking the 7-derivative yields 


(7.33) (A(o(r)) - 8i(r) 0) = 547) (V7) - we). 
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If this holds, setting 7 = 0 gives 


(7.34) (A(ue) — s;(0))¢'(0) =0, 
(7.35) 8; (0) = A; (ue), 


and vy’ (0) is proportional to r;(ue). Reparameterizing in 7, if y’(0) A 0, we can 
assume 


(7.36) (0) = r5 (ue). 
We now show that such a smooth curve (7) exists. Guided by (7.36), we set 
(7.37) p(T) = ue +77; (ue) + 7C(7) 


and show that, for 7 close to 0, there exists ¢(7) € R®# near 0, such that (7.33) 
holds. We will require that ¢(7) € Vj, the linear span of the eigenvectors of A(uz) 
other than r;(we). Then we want to solve for ¢ € Vj, 7 € R: 


(7.38) 77! [F(ue + 775 (ue) +7C) — F(ue)] — (Aj (ue) +n) [rj(ue) + C] = 0. 
Denote the left side by ®,, so 

(7.39) ®,:OxR—R’, 

where © is a neighborhood of 0 in V;. This extends smoothly to 7 = 0, with 


Bo(¢,m) = A(ue) [rj(ue) + C] — (Ag (ue) + 0) [ry(ue) + ¢] 


= = (Alu) — Aj(ue) — 6 mrj(ue). 


Note that ®)(0,0) = 0. Also, 
(7.41) Do(0, 0) .) = (A(ue) — Aj (ue))¢ — rj (ue), 


which is an invertible linear map of V; 6 R + IR”. The inverse function theorem 
implies that, at least for 7 close to 0, ®;(¢(7),7(7)) = 0 for a uniquely defined 
smooth (¢(r),(7)) satisfying ¢(0) = 0, (0) = 0. 

We see that the curve (7) is defined on a two-sided neighborhood of 7 = 0, 
but, taking a cue from § 6, we will restrict this to 7 < 0 to define the j-shock 
curve Yj (ug; T). Comparing (7.36) with (7.26), we see that (ue; 7) and the j- 
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rarefaction curve 25 (ue; T) fit together to form a C'-curve, for 7 € (—7;,0;); we 
denote this curve by yj (ug; 7). 

In fact, assuming genuine nonlinearity, we can arrange that p;(u;T) be aC 4 
curve, after perhaps a further reparameterization of 5 (ue; T). To see this, we 
compute the second 7-derivatives at 7 = 0. This time, for notational simplicity, 


we denote the j-shock curve by ys(7) and the j-rarefaction curve by ¢,(T). 
Recall that, given (7.22), the second equation in (7.20) becomes 


(7.42) eT) =r (Yr (wh 
Differentiation of this plus use of (7.26) yields 
(7.43) y,(0) = Vr; (ue)? 5 (Ue): 


Next, we take the 7-derivative of (7.33). Set A(r) = A(vs(r7)). We get 


pa (AG) ~ 55(7))o(r) + (Ar) - 84()) (7) 


Thus, since s;(0) = A; (ue) and y’,(0) = r; (ue), we have 
(745) (A(0) ~ j(ue)) $4 (0) = (84(0) — A’(0)) rj (we). 


Now A(r)rj(ys(7)) = Aj (¥s(7))rj(Ys(7)). Let us write this identity as 
(A(z) — Aj(7))r;(7) = 0 and differentiate, to obtain 


(7.46) (A) = dj(ue)) 74 (0) = (40) = A’(0))rj(ue). 


Subtracting from (7.45), we get 


(1.47) (Aue) — Ag(we)) (24 (0) = 1(0)) = (285(0) — X4(0))r5 (ue). 


Now the left side of (7.47) belongs to V;, which is complementary to the span of 
7; (ue), So both sides of (7.47) must vanish. This implies 


1 1 1 
(7.48) $;(0) = 5A5(0) = 595(0) - VA; (ue) = 5, 


and, since y{(0) — r/(0) belongs to the null space of A(ue) — A; (ue), 


(7.49) ys (0) = r7,(0) + Gr; (ue), 


for some 3 € R. 
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Note that r/(0) coincides with the quantity in (7.43). We claim that we can 
reparameterize »,(7) so that G = 0 in (7.49), by taking 


(7.50) @5(T) = o(7 +7’), 
for appropriate a. Indeed, we have @,(0) = y,(0), (0) = y4,(0), and 
(7.51) G5 (0) = v5 (0) + 2ay,(0) = ys (0) + 2ar; (ue). 


Thus, taking ~@ = —(/2 in (7.50) accomplishes this goal. Replacing y.(r) by 
(7.50), we arrange that the curve (pj (we; 7) is C? in r. 

Note that if u, = p$(ue; T), for some T € (—7;, 0], the identity (7.48) together 
with (7.35) implies that the shock speed s = s,(7) of the weak solution (7.17) 
satisfies \;(u;) < s < A;(ug), at least if 7 is close enough to 0. In view of the 
ordering of the eigenvalues of A(u), we have the inequalities 


Aj—1 (ug) < s < A; (ue), 
— j-1(te) <8 <Aj(w) 
Aj (ur) <s< Aj+1(ur), 
for 7 sufficiently close to 0. These are called the Lax j-shock conditions. The 
corresponding weak solutions are called shock waves. 
The function yj (ue;7) is in fact C? in (ue; 7). We can define a C?-map 


(7.53) W(ue371,---,TL) = YL (yr-1(- + (yi (ue); T1) +++); Ti21)3 cae 


Since (d/dr)p;(ue; 0) = 7; (ue) and the eigenvectors r;(ue) form a basis of R, 
we can use the inverse function theorem to conclude the following: 


Proposition 7.1. Assume the L x L system (7.1) is strictly hyperbolic and gen- 
uinely nonlinear in each field. Given ug € Q, there is a neighborhood O of ue 
such that if u, € O, then there is a weak solution to (7.1) with initial data 


u(0,2) = ue, forx <0, 


(7.54) 

Ur, forzx > 0, 
consisting of a set of rarefaction waves and/or shock waves satisfying the Lax 
conditions (7.52). 


See Fig. 7.2 for an illustration, with L = 4. 

We consider how Proposition 7.1 applies to some examples. First consider the 
system (7.14), arising from the second-order equation (7.11). Here, with r+ and 
A+ given by (7.16), we have 


1 
(7.55) re -VA = 45 Ky? Kw. 
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FIGURE 7.2 Shocks and Rarefactions 


The strict hyperbolicity assumption is K, 4 0 on 2. Given this, the hypothesis 
of genuine nonlinearity is that /,,, is nowhere vanishing. To achieve (7.22), we 
would rescale r+, changing it to 


, = —2 


7. 
( Se ~ Koy 


; 
| 


Sue. 


In this case, given ug = (vg, we) CNC R?, the rarefaction curves emanating 
from ug are the forward orbits of the vector fields r_ and r_, starting at we. The 
jump condition (7.29) takes the form 


(7.57) We — Wr = 8(vp — ve), 
, K (ve) — K(v,) = s(w, — we), 


in this case. This requires K (ve) — K(v,) = s?(ve — vy), SO 


(7.58) a i: K (vg) — Kur) 


Ue — Ur Ue — Ur 


This defines a pair of curves through we; half of each such curve makes up a shock 
curve. 

One occurrence of (7.11) is to describe longitudinal waves in a string, with 
V(t, x) denoting the position of a point of the string, constrained to move along 
the x-axis. Physically, a real string would greatly resist being compressed to a 
degree that V, = v — 0. Thus a reasonable potential energy function F'(v) has 
the property that F'(v) > +00 asv \, 0; recall K(v) = F’(v). A situation 
yielding a strictly hyperbolic, genuinely nonlinear PDE is depicted in Fig. 7.3, in 
which F' is convex, C is monotone increasing and concave, K,, is positive and 
monotone decreasing, and K,,,, is negative. Here, Q = {(v, w) : v > O}. 

We illustrate the rarefaction and shock curves through a point ue € Q, in 
Fig. 7.4, for such a case. 

A specific example is 


L+v? 1 2 6 
(7.59) F(v) = —, K(v) =1-— = + es 
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Fv) 
K(v) 
v v 
K'(v) 
v v 
K"(v) 


FIGURE 7.3 Typical String Potential 


FIGURE 7.4 Rarefaction and Shock Curves Through we 
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Ug Uy 
uy uy uy uz 


FIGURE 7.5 Solutions to Riemann Problems 


In Fig. 7.5 we illustrate the solution of the Riemann problem (7.17), with u, = uy 
and uw, = Ug, respectively, where wu; and wz are as pictured in Fig. 7.4. 

However, it must be noted that such an example as (7.59) is exceptional. 
A real elastic substance would tend to have a potential energy function F’(v) that 
increases much more rapidly for large (or even moderate) v. A specific example is 


1 1 1 1 
POG Vqeg BOO age 


(7.60) 9 9 


, i 1 1 
R= st qoge K’O=-(a- aye) 


on Q = {(v,w) : 0 < vu < 1}. In this case, the system (7.14) is genuinely 
nonlinear except on the line {v = 1/2}. 

Another situation giving rise to (7.11) is a model of a string, vibrating in R?, 
but (magically) constrained to have only transverse vibrations, so a point whose 
coordinate on the string is x is at the point (xe, V(t, a) € R? at time ¢. In sucha 
case, 2 = R? and F'(v) has the form F(v) = f (v7), so 


K(v) = 2f'(v?)v. 


Thus /(v) is a smooth odd function on R. Hence Ky, is also odd and must vanish 
at v = 0. Thus genuine nonlinearity must fail at v = 0. We will return to this a 
little later; see (7.85)-(7.91). 

We next investigate how Proposition 7.1 applies to the equations of isentropic 
compressible fluid flow, in the form (7.8), which can be cast in the form 


Vt — We = 0,7 


7.61 
( ) w, — K(v,w), = 9, 


a generalization of the form (7.14), if we set 


2 
(7.62) v=p, w=-m, K(v,w)= + p(v). 


(This v is not the v in (7.5).) For smooth solutions, (7.61) takes the form (7.2) 
with 


(7.63) Aye & - i: 
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This has eigenvalues and eigenvectors 


1 1 1 
af 24 = 
(7.64) Ag = —5Kwt 5VKR+4K, ra (),): 


With (vu, w) given by (7.62), we have 


m 
(7.65) At = — + V0'(p), 
p 
which is equivalent to (7.10). In this case, 
66) et eee 1 (0"(o) vo) hd hace oe: 
VP'(p) p 


the last identity holding when p(p) = Ap’, as in (7.6). Thus the system (7.8) is 
genuinely nonlinear in the region 2 = {(p,m) : p > O}. 

A number of important cases of Riemann problems are not covered by 
Proposition 7.1. We will take a look at some of them here, though our treat- 
ment will not be nearly exhaustive. 

First, we consider a condition that is directly opposite to the hypothesis of 
genuine nonlinearity. We say the jth field is linearly degenerate provided 


(7.67) rj(u)- VA;(u) =O on 2. 

In such a case, the integral curve of R; = r; - V through uz, which we denote 
now by 5 (Uwe; T) instead of (7.23), does not produce a set of data u, for which 
there is a rarefaction wave solution to (7.17), of the form (7.18)—(7.20), but we do 
have the following. 

Lemma 7.2. Under the linear degeneracy hypothesis (7.67), if we set 

(7.68) s = X,(ue), 


and let uy = PF (ue; T) for any T (for which the flow is defined), then 


u(t, 2) = ue, for x < st, 
(7.69) (t,c) =u, fe 
Ur, for x> st 


defines for t > 0 a weak solution to the Riemann problem (7.17); that is, the 
Rankine—Hugoniot condition (7.29) is satisfied. Furthermore, 


(7.70) Aj (Ur) = $. 


Proof. Setting p(T) = y5(ue; 7), we have 
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(7.71) g(r) =17;(¢(7)), (0) = ue. 


By the definition of r;, this implies 


(7.72) [A(y(r)) — Aj (y(7)) | ¢'(7) = 0. 


Now the Rankine—Hugoniot condition (7.29) for uy = y(7), with s = A;(ue), 
holds for all 7 if and only if 


d 


(7.73) ae [F(e)) = dj (ue) 0(7)| =0, 


or equivalently, 


(7.74) [A(y(r)) — Aj(ue)] '(7) = 0. 
On the other hand, 

= j(o(7)) = e'(r) VAs (G(r) = 15 (v(7)) - VAs(W(7)) =O. 
(7.75) A (alr)) jay) = 8, Ve 


This implies (7.70) and also shows that the left sides of (7.72) and (7.74) are equal, 
so the lemma is proved. 


When the jth field is linearly degenerate, the weak solution to the Riemann 
problem defined by (7.69) is called a contact discontinuity. The term “contact” 
refers to the identity (7.70), that is, to 


(7.76) Aj (ue) = 8 = Aj(ur), 


which contrasts with the shock condition (7.52). Note that in defining 5 (ue; T), 
we do not restrict 7 to be < 0, as for a 7-shock curve, nor do we restrict T to be 
> 0, as for a j-rarefaction curve. Rather, 7 runs over an interval containing 0 in 
its interior. 

There is a straightforward extension of Proposition 7.1: 


Proposition 7.3. Assume that the L x L system (7.1) is strictly hyperbolic and 
that each field is either genuinely nonlinear or linearly degenerate. Given ue € Q, 
there is a neighborhood O of ue such that if u, € O, then there is a weak solution 
to (7.1) with initial data 


u(0,2) = ue, for «<0, 


(7.77) 
ur, for x>O0, 
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consisting of a set of rarefaction waves, shock waves satisfying the Lax condition 
(7.52), and contact discontinuities. 


An important example is the following system: 


pet (pv) x = 0, 
(7.78) (pv): + (pv) + p(p, S)x = 0, 
(pS)t + (pSv)x =0, 


for a one-dimensional compressible fluid that is not isentropic; here S(t, x) is 
the “entropy,” and the equation of state p = p(p) is generalized to p = p(p,S). 
Compare with (5.61)-(5.62). Using m = pv as before, we can write this system 
as 


Pt + My = 0, 
a 
(7.79) mi + e; +p(p,8)) =0, 
(pS); + (mS), = 0. 
Note that, for smooth solutions, we can replace the last equation by 


(7.80) its ety. 
p 


In this case, we have u = (p,m, S') and 


0 1 O 

m? |, O m 0. 

A(u) = ae 

0 q 2 

(7.81) : 
0 i: -G 

= —v? + 32 2u Sp 
0 0 Vv 


Note that A(w) leaves invariant the two-dimensional space {(a,},0)}, so as in 
(7.10) we have eigenvalues and eigenvectors 


(7.82) At =vt,/S, re =(1,A4,0)*. 


Also, by inspection A(u)' has eigenvector (0,0,1)', with eigenvalue v, which 
must also be an eigenvalue of A(u); we have 


t t 
(7.83) Ay =v=—, ro = (1,0, -24) - (,=,-22) 3 
p P Ps 
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Thus 


m om 1 
(7.84) ro° Vio = -az +7: 7 = 0. 
P P Pp 

Of course r+ -V Ax are still given by (7.66). Thus we have one linearly degenerate 
field and two genuinely nonlinear fields for the 3 x 3 system (7.79). 

In § 10 we will see that the study of a string vibrating in a plane gives rise to a 
4 x 4 system that, in some cases, has two linearly degenerate fields and two gen- 
uinely nonlinear fields, though for such a system there are also more complicated 
possibilities. 

We now return to the 2 x 2 system (7.14), Le., 


V_ — We = O~7 


(7.85) ee ae 


in cases such as those mentioned after (7.60), that is, 
(7.86) K(v) = 2f'(v?)n, 


where 2 = R? and f’ is smooth, with f’(0) > 0. Thus, as computed before, we 
have 


1 
(7.87) Az=t/Ky, rt =(1,-At)*, re-VA = +5 Ky? Kw, 


or, with the + subscript replaced by j7; +1 = (—1)’, j = 1,2, 


-1 
(7.88) RjAj = (-1)) 5Kuwky?. 
The genuine nonlinearity condition fails on the line » = 0. We will assume that 
f' is behaved so that K, > 0onR, Ky, > 0 0n (0,00), Kyy < 0 on (—co, 0), 
and Kyy (0) > 0. Set 


(7.89) OQ. = {(v,w) :+R;A; > 0}, 


so in the case we are considering, the regions Q!, are pictured in Fig. 7.6. 

In Fig. 7.7 we depict the various shock and rarefaction curves emanating from 
ue on the left and those emanating from u, on the right. The rarefaction curves, 
which are integral cuves of R;, terminate upon hitting the vertical axis {v = 0}. 
On the other hand, the shock curves continue to produce solutions to the Riemann 
problem even after they cross this axis, though the Lax shock conditions might 
break down eventually. Note that the rarefaction curves from uy are flow-outs of 
R; in Q4, and flow-outs of —R; in Q7. 

We look at the question of how to solve the Riemann problem when ug cannot 
be connected to curves that avoid the vertical axis v = 0. 
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w w 
Q! Qi Q? Q? 
R, R, 
v ¥ 


FIGURE 7.6 The Regions 0, 


FIGURE 7.7 Rarefaction and Shock Curves Through we and wu, 


In Fig. 7.8 we indicate in one case how to extend the curve y; (uy; 7) for posi- 
tive 7, beyond the point where this curve (which initially, for 7 > 0, is an integral 
curve of R,) intersects the vertical axis. 

To decide precisely which u,. lie on this continued curve, it is easiest to work 
backward from u,, along the shock curve a1, continued across the vertical axis 
into the region {v < 0}. Let u* denote the first point along o, at which the Lax 
shock condition fails. Thus the solution to the Riemann problem with initial data 
u® for x < 0, uy for x > 0, has a one-sided contact discontinuity, in the sense 
that the speed s satisfies 


(7.90) A1(u*) HSs< A1 (ur). 


Then the flow-out from u* under — R, gives rise to ue that are connected to u, by 
a solution such as that indicated in Fig. 7.9. 

Thus the solution consists of a rarefaction wave connecting ug to u*, followed 
by a jump discontinuity that is a one-sided contact and one-sided shock, as stated 
in (7.90). 

In Fig. 7.10, we take the case illustrated by Fig. 7.8 and relabel the old wu, as 
Um, taking a new u,, connected to u,», by Sz U S2, consisting of the shock curve 
out of u,,, continued beyond the vertical axis until the Lax shock condition fails, 
at uw”, and then followed by the flow-out from u® under — Ro. 

The resulting solution to the Riemann problem is depicted in Fig. 7.11. First 
we have the |-rarefaction connecting uz to u*, followed by the jump disconti- 
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FIGURE 7.8 Connecting we to uy 


uy u, 


FIGURE 7.9 One-Sided Contact Discontinuity 


nuity connecting w* to Um, as in Fig. 7.9. Then we have the jump discontinuity 
connecting w,, to u°, satisfying the shock/contact condition 


(7.91) d2(Um) <8 = A2(u®). 


Finally, u? is connected to u, by a 2-rarefaction. 

Figures 7.9 and 7.11 should remind one of Fig. 6.7, depicting the solution to a 
Riemann problem for a scalar conservation law, satisfying Oleinik’s condition (E). 
In fact, it can be verified that the discontinuities produced by the construction 
above satisfy the following admissibility condition. Say a weak solution to (7.85) 
is equal to (vg, we) for x < st and to (v;,w,) for x > st, t > 0. Then the 
admissibility condition is that, for all v between vg and v,, either 
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FIGURE 7.11 Two One-Sided Contacts 


K(v) — K(ve) 2 K(v,) — K(ve) 


(7.92) = (if s < 0), 
U— Ve Ur — Vg 

or 

(7.93) = Ee se a) (if s > 0). 


U— Ug a Ur — Ug 


Compare this to the formulation (6.48) of condition (E). 

In [Liu1] there is a treatment of a class of 2 x 2 systems, containing the case 
just described, in which an extension of condition (E) is derived. See also [Wen]. 
This study is extended to n x n systems in [Liu2]. 

Further interesting phenomena for the Riemann problem arise when there is 
breakdown of strict hyperbolicity. Material on this can be found in [KK2, SS2], 
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and in the collection of articles in [KK3]. We will not go into such results here, 
though some mention will be made in § 10. 

In addition to solving the Riemann problem when wy and u,. are close, one also 
wants solutions, when possible, when we and u, are far apart. There are a number 
of results along these lines, which can be found in [DD, KK1,Liu2,SJ]. We restrict 
our discussion of this to a single example. 

We give an example, from [LS], of a strictly hyperbolic, genuinely nonlinear 
system for which the Riemann problem is solvable for arbitrary ue, u, € QQ, but 
some of the solutions do not fit into the framework of Proposition 7.1. Namely, 
consider the 2 x 2 system (7.5)-(7.6) describing compressible fluid flow, for «= 
(v,p), with Q = {(v,p) : p > O}. As seen in (7.10), if we switch to (m, p)- 
coordinates, with m = pu, then there are eigenvalues Ax = m/p + \/p'(p) 
and eigenvectors Ri = 0/09 + 4+0/Om. Thus integral curves of R satisfy 


p=1, nm=m/p + v/p'(p); hence 


mp—mp _ , VP(p) 
p 


o] 


that is, integral curves of Rs through (ve, pe) are given by 


Pp / p 
(7.94) v-—v=t vP(s) ds =+ / Ay g(%—3)/2 He 
pe 


pe $ 


If y € (1,2), as assumed in (7.6), then these rarefaction curves intersect the axis 
p = 0. Note that if we normalize R+ so that Ria A+ = 1, then 


(7.95) Ra = (Ay2p78) 1? zee + * : 
p Ov’ Op 


Furthermore, specializing (7.58), we see that the shock curves from ug are 
given by 


_ 1/2 
(7.96) v-—vug= 7 (p(p) a) ; for + (pe — p) > 0. 


Note that these shock curves never reach the axis p = 0. See Fig. 7.12 for a picture 
of the shock and rarefaction curves emanating from ug. 

Now, as in Fig. 7.13, pick uo = (vo, 0) € Q and consider the “triangular” 
region 7, with apex at uo, bounded by the integral curves of R_ (forward) and 
of R+ (backward) through uo, and by the axis p = 0. This is a bounded region. 
Given any uz, u, € 7, we will produce a solution to the Riemann problem, whose 
intermediate state also belongs to 7 (or at least to T). 

In fact, as seen in Fig.7.13, if we € TJ, the rarefaction and shock waves 
described before suffice to do this for u,. in all of 7 except for a smaller trian- 
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FIGURE 7.12 Shock and Rarefaction Curves Through we 


p 


: 


u* 


FIGURE 7.13 Vacuum Region 


gular region in the lower right corner of 7, which we call the “vacuum region.” 
This is bounded by part of 07, plus part of the integral curve of R emanating 
from u*, where u* is the point of intersection of the R_-integral curve through we 
with {p = 0}. 

What we do if u,. belongs to this vacuum region is indicated in Figs. 7.14 and 
7.15. Namely, wg is connected to the vacuum by a rarefaction wave, whose speed 
on the left is A_(ue) = ve — \/p’(pe) and whose speed on the right is A_(u*) = 
A_(v*,0) = v* (since p’(0) = 0 when (7.6) holds). Next, if w* = (v%,0) is the 
point on the axis {p = 0} from which issues the R-integral curve through w,., 
then the vacuum is connected to u,. by a rarefaction wave whose speed on the left 
is A+ (u*) = v% > v* (if u* 4 u*) and whose speed on the right is \+(u,-). In the 
special case that u° = u*, the vacuum state disappears, except for a single ray, 
along which the two rarefaction waves fit together. 
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Uo 


FIGURE 7.14 Connecting we to uy Through the Vacuum 


t 


x=Vv1t 


x=v*t 


vacuum 
x=),(u,)t 


r 


FIGURE 7.15 Associated Solution to the Riemann Problem 


This concludes our discussion in this section of examples of the Riemann prob- 
lem. In § 10 there is further discussion for equations of vibrating strings. 

Continuing a theme from 8 6, we next explore the relation between the shock 
condition (7.52) and the possibility that the solution wu is a limit ase \, 0 of 
solutions to 


(7.97) O,u, + O,F (u,) = 60? ug. 
Here, we will look for solutions to (7.97) of the form 
(7.98) tic(t, 2) = v(e*(@— at). 
This satisfies (7.97) if and only if 


d 


(7.99) fe 


[F(v) — ev(r)] = (7), 
or equivalently, if and only if there exist b € R” such that 


(7.100) u(r) = F(v) — cv —b = ©, (v). 
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In other words, v(7) should be an integral curve of the vector field ®.,. The 
requirement that the limit u(t, ) satisfy the Riemann problem (7.17) is equiv- 
alent to 


(7.101) u(—co) = ue, u(+00) = ur. 


Consequently, ug and wu, should be critical points of the vector field ®,,, con- 
nected by a “heteroclinic orbit.” If this happens, we say ue is connected to u,. via 
a “viscous profile.” 

For ug and wu, to be critical points of ®.,, we need 


(7.102) F (ue) — cue = b = F (ur) — cur, 
hence 
(7.103) F(u,) — Fue) = c(up — ue). 


This is precisely the Rankine—Hugoniot condition (7.29), with s = c. Now, con- 
sider the behavior of the vector field ®., near each of these critical points. The 
linearization near uo = ug or uy is given by 


(7.104) V(uo + v) = (A(uo) — s)v. 


Now, if (7.52) holds (e., A; (ur) < s < Aj (ue)), and if u, and we are sufficiently 
close, then A(ue) — s has L — (7 — 1) positive eigenvalues and 7 — 1 negative 
eigenvalues, while A(u,) — s has L — j positive eigenvalues and j negative eigen- 
values. 

The qualitative theory of ODE guarantees the existence of a heteroclinic orbit 
from ug to u, (if they are sufficiently close). We will not give the proof here, but 
confine our discussion to a presentation of Fig. 7.16, illustrating the 2 x 2 case 
in which ug is connected to u, by a 1-shock. The ODE theory involved here has 
been developed quite far, in order also to investigate cases where ue and u, are 
not close but can still be shown to be connected by a viscous profile. The book 
[Smo] gives a detailed discussion of this. 

We mention a variant of the viscosity method described above, which was used 
in [DD]. Namely, we look at a family of solutions to 


(7.105) Oue + O,F (ue) = et O2u, 
of the form 
(7.106) ue(t,£) = u- (ta), 


where u-(T) solves 


(7.107) evg (rT) = [A(ve) — T] ve (7), 
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uy 


FIGURE 7.16 Forcing a Heteroclinic Orbit 


and 
(7.108) U_(—00) = ug, v_(+00) = ty. 


Setting w.(7) = v)(7), we get a (2) x (2L) first-order system for V. = (ve, we): 


(7.109) Vinj= U(r, V.(r)) = (we(r),€7*[A(ve) —- T]we(T)), 
with 
(7.110) V-(—o0) = (ue,0), Ve(+0o) = (uy, 0). 


The paper [DD] considered such solutions when (7.1) is a2 x 2 system, satisfying 


F F. 
oe 25 oF 26, Vue R?, 
Oug 


(7.111) - 


a condition that guarantees strict hyperbolicity. In particular, it is shown in [DD] 
that this viscosity method leads to a solution to the Riemann problem for all data 
(ug, Uy) whenever (7.1) is asymmetric hyperbolic 2 x 2 system, satisfying (7.111). 

We mention another “viscosity method” that has been applied to 2 x 2 systems 
of the form (7.14). Namely, for ¢ > 0, consider 


Ut — We = 0, 
(7.112) 
wr — K(v)2 = Vet. 
This comes via v = uz, Ww = uz, from the equation 


(7.113) iy —K (Up) — Ete, 
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which arises in the study of viscoelastic bars; see [Sh1] and [SI]. We look for 
a traveling wave solution U = (v,w) of the form U((x — st)/e), satisfying 
U(—co) = (vg, we), U(+00) = (v;, w,). Thus we require 


(7.114) 


hence 


(7.115) sv'(r) = K(v) — K(ve) — 8(v— ve); v(—00) = ve, v(-+00) = vp. 


For this to be possible, one requires that W(v) = K(v) — K(ve) — s?(v — ve) 
vanish at v = vu, as well as v = ve; this together with the first part of (7.114) 
constitutes precisely the Rankine-Hugoniot condition, that U = (vg, we) for 
x < st, (vp,w,) for 2 > st, t > 0, be a weak solution to (7.14). In addi- 
tion, in order to solve (7.115), one requires that ve be a source for the vector field 
(sgn s)W(v)0/dv on R, that v, be a sink, and that there be no other zeros of w(v) 
for v between ve and v,.. Thus we require 


K(v) = Kw) _ 


U — Ve 


> 0, 


for v between vz and v, < if s > 0, and the reverse inequality if s < 0. Note that 
this implies the admissibility condition (7.92)-(7.93), given that K(v,)—K (vz) = 
s*(u, — ve). See the exercises after § 8 for more on this viscosity method. 

There is a method for approximating a solution to (9.1) with general initial 
data, via solving a sequence of Riemann problems, called the Glimm scheme, after 
[G11], where it is used as a tool to establish the existence of global solutions for 
certain classes of initial-value problems. The method is the following: Divide the 
x-axis into intervals J, of length ¢. In each interval J_, pick a point x,,, at random, 
evaluate u(0,x,) = a,, and now consider the piecewise-constant initial data so 
obtained. Assuming, for example, that (8.1) is strictly hyperbolic and genuinely 
nonlinear, and |u(0,x)| < C, one can obtain for small h a weak solution u(t, x) to 
(8.1) on (t, 2) € [0,2] x R, consisting locally of solutions to Riemann problems; 
see Fig. 7.17. Now, pick a new sequence y, of random points in J,, evaluate 
u(h, yv) = bp, and repeat this construction to define u(t, x) for (t,x) € [h, 2h] x 
R. Continue. In [GI1] there are results giving conditions under which one has 
v = ven well defined for (t,x) € R* x R, and convergent to a weak solution as 
£ — 0, h = coé. Further results can be found in [GL, DiP1, Liu5]; see also the 
treatment in [Smo]. In §9 we will describe a different method, due to [DiP4], to 
establish global existence for a class of systems of conservation laws. 
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FIGURE 7.17 Setup for Glimm’s Scheme 


Exercises 


In Exercises 1-3, we consider some shock interaction problems for a system of the 
form (7.1). Assume (7.1) is a 2 x 2 sysyem, strictly hyperbolic and genuinely nonlinear. 
Assume ug and u,. are sufficiently close together. 

1. Suppose that, for t < to, wu takes three constant values, ue, Um, Ur, in regions separated 
by shocks of the opposite family, with shock speeds s,,s—. Assume the faster shock 
is to the left. Thus these shocks must intersect; say they do so at t = to (see Fig. 7.18). 
Show that the solution to the Riemann problem at t = to, with data we, ur, consists 
of two shocks, s_, s+, as depicted in Fig. 7.18. In particular, there are no rarefaction 
waves. 

2. Suppose that, for t < to, u takes on three constant values ue, Um, Ur, In regions sepa- 
rated by shocks of the same family, say s+, and assume that the left shock has higher 
speed than the right shock. Thus these two shocks must intersect; say they do so at 
t = to (see Fig. 7.19). Show that the solution to the Riemann problem at t = to, with 
data we, ur, consists of a shock of the same family as those that interacted, together 
(perhaps) with either a shock wave or a rarefaction wave of the other family. (Hint: 
Study Fig. 7.4.) 

If only the second possibility can occur when two shocks of the same family collide, 
the 2 x 2 system is said to satisfy the “shock interaction condition.” This condition was 
introduced by Glimm and Lax; see [GL]. 

3. Show that the shock interaction condition holds, at least for sufficiently weak shocks, 
provided that 2 = R? and, for each up € 2, the curves p1(ue;T) and y2(ue;T) are 
both strongly convex, as in Fig. 7.20. Here, y;(ue;7) is obtained by piecing together 
the rarefaction curve yj (ue; T) and the shock curve yj (ue; 7). (Hint: Show that if, for 


FIGURE 7.18 Situation for Exercise 1 
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FIGURE 7.19 Situation for Exercise 2 


FIGURE 7.21 Possible Attack on Exercise 3 


example, wm lies on the 2-shock curve from we, as in Fig. 7.21, then the 2-wave curve 
(p2(Um;*) = So U Ro is as pictured in that figure, as is the continuation of ~3 (ue; T) 
for T < 0. To do this, you will need to look at O2y; (ue; +0). See [SJ].) 

4. Strengthen Proposition 7.1 as follows. Under the hypotheses of that proposition: 
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Claim. Given uo € Q, there is a neighborhood O of uo such that if we, ur € O, then 
there is a weak solution to (7.1) with initial data u(0,x) = ue forx < 0, uy for 
x>0. 


What is the difference? Similarly strengthen Proposition 7.3. 

5. Consider shock wave solutions to the system produced in Exercise | of §5, namely, 
spherically symmetric shocks in compressible fluids. 

6. Show that a solution to the system (7.1) is given by 


(7.116) u(t, xz) = v(y(t,2x)), 
where y is real-valued, satisfying the scalar conservation law 


(7.117) ye + Aj(v(~)) ee = 0, 


for some j, and v’(s) is parallel to r;(v(s)), with Aj, rj as in (7.3). 

Such a solution is called a simple wave. Rarefaction waves are a special case, called 
centered simple waves. 

Considering (7.117), study the breakdown of simple waves. 


8. Entropy-flux pairs and Riemann invariants 


As in 87, we work with an L x ZL system of conservation laws in one space 
variable: 


(8.1) us + F(u)e =0, 


where u takes values in Q C R¥ and F : Q > R® is smooth. Thus smooth 
solutions also satisfy 


(8.2) up + A(u)ug =0, A(u) = DyF(u). 


As noted in § 7, if u(t, z) vanishes sufficiently rapidly as 2 — too, then 


(8.3) / u(t, x) dx € RY 


is independent of t; so each component of (8.3) is a conserved quantity. 
An entropy-flux pair is a pair of functions 


(8.4) 7g: 2 3R 
with the property that the equation (8.1) implies 


(8.5) muje+ (Ua = 0 
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as long as u is smooth. If there is such a pair, again given appropriate behavior as 
x — oo, we have 


d 
(8.6) 5 | n(utt.2)) dz = 0, 


(8.7) [nut a)) dx = I,,(u) 


is independent of t, hence is another conserved quantity, provided u(t,) is 
smooth. As we’ll see below, the situation is different for nonsmooth, weak solu- 
tions to (8.1). 

To produce a more operational characterization of entropy-flux pairs, apply the 
chain rule to the left side of (8.5), to get uw, - Vn(u) + uz - V¢(u), and substitute 
uz = —A(u)uz from (8.2), to get 


(8.8) n(w)e + G(U)e = te *(—A(u)'Vn(u) + Va(u)). 
Thus the condition for (7, q) to be an entropy-flux pair for (8.1) is that 
(8.9) A(u)'Vn(u) = Vq(u). 


Note that (8.9) consists of L equations in two unknowns. Thus it is overde- 
termined if L > 3. For L > 3, some special structure is usually required 
to produce nontrivial entropy-flux pairs. For example, if A(u) is symmetric, 
so OF; /Oux = OF, /Ou;, and if Q C IR is simply connected, we can set 
Fy(u) = Og/Ouc. In such a case, 


B10 nw=Z 08, aly) = ey Flw) - ole, 


is seen to define an entropy-flux pair. Note that in this case 77 is a strictly convex 
function of w. 

If L = 2, then (8.9) is a system of two equations in two unknowns. We can 
convert it to a single equation for 7 as follows (assuming (2 is simply connected). 
The condition that A;,;(u) 0n/Ou, be a gradient field is that 


(8.11) jon (Aug) = Jay (Ault): 


for all 7, 2. We use the summation convention and hence sum over k in (8.11). We 
need verify (8.11) only for 7 < @, hence for 7 = 1, ¢ = 2, if L = 2. Carrying out 
the differentiation, we can write (8.11) as 
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m mm an a 
(8.12) [5 Aj (u) — 5; Ane(u)] aon 0, Vj<2 
In case L = 2, this becomes the single equation 
0?n O7n 0?n 
(8.13) Bulwae + PBB) san + Boal) 2 = 0, 
with 
OF 
Byi(u) = —Aja(u) = a 
U2 
OF: 
(8.14) Boo(u) = Aoi (u) = aa 
U1 
OF OF: 
2By2(u) = Aii(u) — Aro(u) = a - Fur’ 


Lemma 8.1. [f (8.2) is a2 x 2 system, then (8.13) is a linear hyperbolic equation 
for 7 if and only if (8.2) is strictly hyperbolic. 


Proof. The equation (8.13) is hyperbolic if and only if the matrix B(u) = 
(Bjx(w)) has negative determinant. We have 


det B(u) = —A12Aa1 — 7(An — Ago)” 
= ~7(4d, — 2A11Ag2 + AB, + 4412A01). 
Meanwhile, 
det(A(u) — A) = A? — (Ai + Ao2)A + Art Aaa — Ai2Aar, 
so A(w) has two real and distinct eigenvalues if and only if 
(Ay + Age)? — 4(A11- Age — A1gAo1) > 0. 


This last quantity is seen to be equal to —4 det B(u), so the lemma is proved. 


We will be particularly interested in producing entropy-flux pairs (7, q) such 
that 7 is convex. The reason for doing so is explained by the following result, 
which extends (6.21)—(6.24). 


Proposition 8.2. Consider solutions uz of 
(8.15) Oru, + A(uic)Ozue = €O2u,-, € > 0. 


Suppose that, as € \, 0, ue converges boundedly a.e. to u, a weak solution of 
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(8.16) Ou + Or F(u) = 0. 


Tf (1, q) is an entropy-flux pair and n is convex, then 


(8.17) muje + a(u)a <9, 
in the sense that this is a nonpositive measure. 


Here, if F',7, and q are defined on an open set 2 C RY”, we assume u-(t, 7) € 
kod. 


Proof. Take the dot product of (8.15) with Vn(u-) to get 


(8.18) An(ue) + Axg(ue) = EVN (Ue) » Pte. 

Use the identity 

(8.19) n(v)ce = Vn(v) - vee + Danilo) )(020;)(OxVk),  Nyx(v) = eum 
UROV; 

to get 

20) Muede + a(te)e = en(te)aw — 2 —€ > Njk(te)(Oetjc)(Oxtke) 


< eed 


by convexity of 7. Now passing to the limit ¢ + 0 gives 


(8.21) Nuc) + n(u), q(uz) + q(u), 


boundedly and a.e., hence weak* in L°°, while the right side of (8.20) tends to 0 
in the distributional topology. This yields (8.17). 


The inequality (8.17) is called an entropy condition. 

Suppose wu is a weak solution to (8.1) which is smooth on a region 0 C R? 
except for a simple jump across a curve y C O. If (7, q) is an entropy-flux pair, 
then 7(u): + q(u)2 = 0 on O \ +. Suppose (8.17) holds for uw. Then the negative 
measure 7)(u); + g(u), = —y is supported on 7; in fact, for continuous y with 
compact support in O, 


(8.22) — f eau= f (stn - ta)e ao, 


where [7] and [q] are the jumps of 7 and q across ¥, in the direction of increasing t; 
s = dx/dt on7 is the shock speed; and do is the arclength along y. Consequently, 
such an entropy-satisfying weak solution of (8.1) has the property 
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(8.23) sin] —[q <0 on 4. 


We remarked in § 7 that if wp and wu, are close and the Riemann problem (7.17) 
has a solution consisting of a j-shock, satisfying the Lax shock condition (7.52), 
then ue and wu, are connected by a viscous profile; we sketched a proof for 2 x 2 
systems. It follows from Proposition 8.2 that such solutions satisfy the entropy 
condition (8.17), for all convex entropies. 

We give some explicit examples of entropy-flux pairs. First consider the system 
(7.14), namely, 


V_ — We = 0,7 


(8.24) a — KW), <0, 


for which A+ and r+ are given by (7.16). In this case, one can use 


(8.25) n(v,w) = we + [ K(s) ds, q(v,w) = —wK(v). 


Note that 77 is strongly convex as long as K’(v) > 0. 
For the equation (7.5) of isentropic compressible fluid flow, we can set 


1 Pf 
8.26 fey) =r’ + X10), X"0) = [PO as, 


which is the total energy, with flux 


1 
(8.27) a(v,p) = (500 + X'(o)p)e. 


In the (p, m)-coordinates used to express the PDE in conservation form (7.8), we 
have 


(8.28) nlp,m) = + X(p). 


In this case 
if m? m 
(8.29) Dn = @ + ps *) ice ae +0" 4) 
m p ? 


so 7(p,m) is strongly convex as long as p’(p) > 0. 

We aim to present a construction of P. Lax of a large family of entropy-flux 
pairs, for 2 x 2 systems. In order to do this, and also for further analysis in § 9, it 
is useful to introduce the concept of a Riemann invariant. If A(w) = D, F'(u) has 
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eigenvalues and eigenvectors \;(w), 7;(w), as in (7.3), we say a smooth function 
€:Q — Risa k-Riemann invariant provided r; - Vé = 0. 


ana 


(8 


In the case of a system of the form (7.14), where r= = (1,—A4)’, AZ = 
.(v), we see that Riemann invariants are constant on integral curves of 0/Ov — 
.(v) 0/Ow, that is, curves satisfying dv/dw = —1/A+(v), so we can take 


) 


30) E(w) aw f Aste is=wa [ SRG) a 


Also, any functions of these are Riemann invariants. 


In the case of the system (7.8) for compressible fluids (in (p,m) coordinates), 


where we have r+ = (1, A+)’, Az = m/p+ v/p'(p), the Riemann invariants are 
constant on integral curves of 0/0p + A+0/Om (i.e., curves satisfying dm/dp = 


m 


/p+/p'(p)). If we switch to (p, v)-coordinates, with v = m/p, then dm/dp = 


pdv/dp + v, so these level curves satisfy pdu/dp = +,/p'(p). Hence we can 
take 


(8 


p / 9 
ay gal) sue f YEE) asa ve PAT porn, 
Po = y-1 


the latter identity holding when p(p) = Ap7, with y > 1, and we take po = 0. 


The following is a useful characterization of Riemann invariants. 


Proposition 8.3. Suppose that (8.1) is a strictly hyperbolic 2 x 2 system and that 


Q. has a coordinate system (&1, €), such that €} is a k-Riemann invariant. Then, 
fork = 1,2, 
(8.32) A(u)'VEx(u) = Aj(u)VEx(u), J #. 


Conversely, for 7 = 1,2, 


(8 


33) A(u)*VE(u) = Aj(u)VE(u) => E is a k-Riemann invariant, k # j. 


Proof. Since {Vé1(w), Véo(u)} is a basis of R? for each u € 2, we see that 


(8 


34) rr(u) - C(u) = 0 => C(u) = a(u)VEx(u), 


for some scalar a(u). Meanwhile, 


(8 


35) A(u)*C(u) = Aj(u)C(u) => re A(u)’C = Ayr: C. 


Since also rg - A(u)’¢ = ¢- A(u)ra = AnTR: ¢ and A; A Ax, we see that 
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(8.36) 


the last implication by (8.34). However, since A(u)' does have a nonzero Aj- 
eigenspace, this yields (8.32). It also establishes the converse, (8.33). 


Proposition 8.3 has the following consequence: 


Proposition 8.4. Suppose that (8.1) is a strictly hyperbolic 2 x 2 system and that 
Q. has a coordinate system &;,, k = 1,2, consisting of k-Riemann invariants. If u 
is a Lipschitz solution of (8.1), then 


Or€1(u) + A2(u)OrE1(u) = 0, 


(8.37) Or€2(u) + A1(U)OzE2(u) = 0. 


Proof. For 7 4 k, we have 
OE; (U) + An (U)Onkj(u) = Oru VEi(U) + An(u)Onw - VEj(u) 


(8.38) = Ou VEj(u) + Apu A(u)'VE;(u) 
= (Qu + A(u)d,u) - VE;(u), 


the second identity by (8.32). This proves (8.37). 


Following [L4], we now present a geometrical-optics-type construction of 
solutions to (8.9), for certain 2 x 2 systems, which yields convex entropy functions 
in favorable circumstances. We look for solutions of the form 


n= e8? (no + kim +++ + kN ny + fin), 


(8.39) . 
FP (gy +k-1g, +++ + kN Gn + Gn), 


q=e 
where yp = (wu), nj = n;(u), 47 = 9;(u), & is a parameter that will be taken 
large, and we will have 7, ¢v = O(k~). In fact, plugging this ansatz into (8.9) 
and equating like powers of k, we obtain 

(8.40) WV PE = NoA(u)'Ve, 

and, forO<j< N-1, 

(8.41) ngj+iVe + Vqj = 7541 A(u)’ Ve + A(u)*'Vn;. 


If yo # 0, (8.40) says 


(8.42) A(u)'Vye = @ Vy, 
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so qo/No is an eigenvalue of A(u)’ and Vy an associated eigenvector. By 


Proposition 8.3, the equation (8.40) holds provided we take 

(8.43) go =Aem, YP=kke, KE, 

where Xz is one of the two eigenvalues of A(u) and &;, is a k-Riemann invariant. 
We have solved the eikonal equation for y. For definiteness, let us take & = 1, 
; oe than tackle (8.41) directly, let us note that (8.9) is equivalent to 

(8.44) Ruq=AvRvn, Rv=r-V, v=i,2. 

Thus we can rewrite (8.40) and (8.41) as 

(8.45) phy =nrA Ry, v=1,2, 

and, forO <j < N—-1, 

(8.46) q+Rly + Rig = nA Re + ARLYN, vv =1,2. 

Clearly, (8.43) yields ~, qo, 70 satisfying (8.45). We have 

(8.47) Rig=0, Rop = Ro. 

Thus (8.46) takes the form 

(8.48) = Rigg =ArRinj, (jt — A2Nj+1)(Re€k) = A2Ranj — Rag;- 

For j = 0, using go = A270, we obtain the transport equation 


Ryr2 
8.49 R =0 
(8.49) Mo + ae ee 


which is an ODE along each integral curve of R,. This specifies 79, given initial 
data on a curve transverse to /;, and then qo is specified by (8.43). We can arrange 
that 79 > 0. Note that this specification of 79, go is independent of the choice of 
1-Riemann invariant y = €,. 

Similarly the higher transport equations (i.e., (8.48) for 7 > 1), give 7; and q;, 
for 7 => 1. Compare the geometrical optics construction in § 6 of Chap. 6. Once 
the transport equations have been solved to high order, one is left with a nonho- 
mogeneous, linear hyperbolic system to solve, to obtain exact solutions (1, q) to 
(8.9). 

It is also useful to write the transport equation (8.46) using the Riemann invari- 
ants (£1, €2) as coordinates on Q, if that can be done. We obtain for 7 = (£1, €2) 
and q = q(1, €2) the system 
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On Oq 
8.50 = ; = , 
les 86, Oe? BE, OE 


equivalent to (8.9) and to (8.44), and, if ~ = &1, then (8.46) becomes 


Oq; On; 
8.51 — =r, 44 — Ao = Ao -— —. 
Gy Be; BG, Ee Be, 


The equation (8.49) takes the form 


Ono 1 OrA2 


8.52 + = 
ae ie Apa Oe 


0. 


As stated in (8.43), we have qg = A279. We also record one implication of (8.51) 
for 1, 1: 


Od2 
8.53 — Aom = —-—= "0. 
(8.53) a — A2m DE, No 
In particular, if 79 > 0, then gi — A271 has the opposite sign to OA2/OE, (if this 
is nonvanishing, which is the case if (8.1) is genuinely nonlinear). 
Since it is of interest to have convex entropies 7, we make note of the following 
result, whose proof involves a straightforward calculation: 


Proposition 8.5. [f (k) is given by (8.39), with no > 0 on Q, then, for k suf- 
ficiently large and positive, n(k) is strongly convex on any given Q) CC Q, 
provided Vp # 0 on Q and, at any point ug € Q, if V = a, 0/Ou, + a2 O/Ouz, 
is a unit vector orthogonal to Vp(uo), then 


(8.54) V7(uo) > 0. 


If y satisfies the hypotheses of Proposition 8.5, we say ¢ is (strongly) quasi- 
convex. Clearly, (8.54) implies that a tangent line to {y = c} at uo lies in {y > c} 
on a punctured neighborhood of ug. Equivalently, y is quasi-convex on 2 C R? if 
and only if the curvature vector of each level curve {yy = c} at any point uo € 2 is 
antiparallel to the vector Vy(uo). Note that if 2 is convex and y is quasi-convex 
on 22, then each region {iy < c} is convex. 

Thus a favorable situation for exploiting the construction (8.39) to obtain a 
strongly convex entropy is one where 2. has a coordinate system (£1, £2) consist- 
ing of quasi-convex Riemann invariants. Note that if this is the case, we can form 
Ga eS; for some large constant \, and obtain a coordinate system consisting 
of strongly convex Riemann invariants. 

Consider the Riemann invariants €+ of (8.30), for the system (7.14), containing 
models of elasticity. We see that £,; and —f_ are quasi-convex, where K”(v) > 0, 
and that —€, and €_ are quasi-convex, where K”’(v) < 0, granted that K’(v) is 
nowhere vanishing. As for the Riemann invariants (8.31) for the system (7.7) of 
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compressible fluid flow, with variables (v, o), given 1 < y < 3 we have €, and 
—€_ quasi-convex on {(v, p) : p > O}. 

We end this section with the remark that the proof of Proposition 8.4 provides 
just a taste of the use of geometrical optics in nonlinear PDE, extending such 
developments of geometrical optics for linear PDE as discussed in Chap. 6. For 
further results on nonlinear geometrical optics, one can consult [JMR] and [Kic], 
and references given therein. In particular, [Kic] describes how constructions of 
nonlinear geometrical optics lead to such “soliton equations” as the Korteweg— 
deVries equation, the sine-Gordon equation, and the “nonlinear Schrédinger 
equation.” Studies of propagation of weak singularities of solutions to nonlin- 
ear equations, initiated in [Bon] and [RR], have also been pursued in a number of 
papers. Expositions of some of these results are given in [Bea, H, Tay]. 


Exercises 


1. Assume (7, ¢) is an entropy-flux pair for (8.1), and fix uo € Q. Show that 


7(u) = n(u) — (uo) — (u— wo) - Vn(uo), 


(8.55) Gu) = q(u) — q(uo) — (F(u) — F(uo)) - Vn(uo) 


is also an entropy-flux pair. Note that if 7 is strictly convex, then 7(u) > 0, and it 
vanishes if and only if u = uo. 


Exercises 2-4 involve an L x L system of conservation laws in n space variables: 


(8.56) ur + 5-0; F;(u) = 0, 


where F; : Q > R®, Q open in R”. An entropy-flux pair is a pair of functions 
7: QR, g: QR", 
satisfying 
(8.57) A;(u)'Vn(u) = Vaj(u), 1<i<n, 


where A;(u) = DuF;(u),q(u) = (q(u),...,qn(u)). This material is from [FL2]. 
2. Show that if (8.57) holds, then any smooth solution to (8.56) also satisfies 


n(u)e + 2 Ojq;(u) = 0. 


3. Show that if each A;(w) is a symmetric L x DL matrix, then an entropy-flux pair is 
given by 
1 
nu) = slur’, qj(u) = S> weFie(u) — 9; (u), 
e 


where F;(u) = (Fji(u),..., Fy (u)), Fye(u) = 09; /Oue. 
4. Show that if there is an entropy-flux pair (7, q) such that 7) is strongly convex, then the 
positive-definite, L x L matrix (0? /OujOu,) is a symmetrizer for (8.56). 
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In Exercises 5-7, let wu: = (ve, we) be smooth solutions to 


O1Vve — Or We = 0, 


8.58 
eae) Opwe — Or K (ve) = cOzO¢Ve, 


fort > 0. Assume € > 0. 
5. If either x € S* or the functions in (8.58) decrease fast enough as |x| — oo, show that 


satisfies 


6. If (7, q) is an entropy-flux pair for 


8.59 
( ) wr— K(v)2 =0 
show that 
Ovn(ue) + Oxrg(ue) = cA" (ue) Oo we. 
If 
On =, 
(8.60) Dy (ue? = C (we), 


and ¢ is convex (which holds for (7, g) given by (8.25)), deduce that 


Ain(ue) + Oxq(ue) < €O2C(we). 


7. Now suppose that, as ¢ \, 0, we converges boundedly to u = (v, w), a weak solution 
to (8.59). If (7, q) is an entropy-flux pair and (8.60) holds, with ¢ convex, deduce that 


(8.61) n(u)e+ (we <0, 


in the same sense as (8.17). Taking (7, q) as in (8.25), deduce that if u has a jump across 
y, as in (8.23), then 


K (ve) (vr — ve) < i. K(o) do — 5 (tr _ we)’, 


K (ve) vr — ve) < [" K (0) dor + 5 (we ~ we)? 


ve 
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Here we establish existence, for all t > 0, of entropy-satisfying weak solutions to 
a class of 2 x 2 systems of conservation laws in one space variable: 


(9.1) u+ F(u)e =0, u(0,x) = f(a). 


We will take x € S' = R/Z; modifications for x € R are not difficult. We assume 
f takes values in a certain convex open set 2 C R? and F' : Q + R? is smooth. 
As before, we set A(u) = DF'(u), a2 x 2 matrix-valued function of u. We assume 
strict hyperbolicity; namely, A(w) has real, distinct eigenvalues \;(u) < A2(u), 
with associated eigenvectors r1(u), 72(u). We will assume that Q has a global 
coordinate system (£1, £2), where €; € C°°(Q) is a j-Riemann invariant. In fact, 
we assume that € maps (2 diffeomorphically onto a region 


R={E: At <& < By, Ao < &2 < By}, 
where —oo < A; < Bj < +00. The assumptions stated in this paragraph will be 


called the “standard hypotheses” on (9.1). 
We will obtain a solution to (9.1) as a limit of solutions to 


(9.2) O,U, + 0,F (ue) = €0?u,, u-(0,2) = f(z). 
Methods of Chap. 15, § | (particularly Proposition 1.3 there), yield, for any « > 0, 
a solution u(t), defined for 0 < t < T(e), given any f € L°(S'), taking values 


in a compact subset of 0. The solution is C°° on (0, T(€)) x S* and continues as 
long as we have 


(9.3) u-(t,z) € K, 


for some compact K C (2. For now we make the hypothesis that (9.3) holds, for 
all t > 0. We also have the identity 


t 
(9.4) llue(t)|lZ2 +ef ||Axue(s)|I72 ds = || f\lZ2- 


To study the behavior of the solutions uz to (9.2) as ¢ — O, we use the 
theory of Young measures, developed in § 11 of Chap. 13. By Proposition 11.3 
of Chap. 13, there exists a sequence u; = u-,, with e; — 0, and an element 
(u, A) € Y(IRt x S') such that 


(9.5) uj + (u,A) in Y(R* x S$"). 


By Proposition 11.1 and Corollary 11.2 of Chap. 13, 
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(9.6) F(u;) + F weak* in L®(R* x S*), 

where 

(9.7) F(z) = | F(y) dvzc(y), ae. (t,2) € Rt x S’. 
R2 


Since €02u; — 0 in D’'(R* x S'), this implies 
(9.8) Ou+ O,F = 0. 


To conclude that u is a weak solution to (9.1), we need to show that F = F(u), 
which will follow if we can show that the convergence u; + (u, A) in Y° (Rt x 
B) is sharp (i.e., AX = Yy), or equivalently, that A; is a point mass on R?, for 
almost every (t,x) € R* x S?. 

Following [DiP4] we use entropy-flux pairs as a tool for examining 4, in a 
chain of reasoning parallel to, but somewhat more elaborate than, that used to 
treat the scalar case in § 11 of Chap. 13. 

For any smooth entropy-flux pair (7, q), we have 


(9.9) A:n(Ue) + Ang(ue) = €02N(Uc) — EOnte - 1" (Ue) One, 


where 1’(u-) is the 2 x 2 Hessian matrix of second-order partial derivatives of 17. 
We have the identity 


(9.10) ff aeuen"(u) da dx dt = [ult@) as— f n(ue(T2)) dx. 


We rewrite (9.9) as 


(9.11) Ien(ue) + Orq(te) = €0;N(Ue) — Re, 
with 
(9.12) R- bounded in L'(R* x S"). 


If 7 is convex, this follows directly from (9.10), since then the left side of (9.10) 
is the integral of a positive quantity. But even if 7 is not assumed to be convex, 
we can appeal to (9.4) to say \/£0,,u- is bounded in L?(IRt+ x S*), and this plus 
(9.3) implies (9.12). 

Since 0,7(uz) = 7 (ue) OrUe, we also deduce from (9.3)-(9.4) that the quan- 
tity \/€0,n(u-) is bounded in L?(IR*t x $+). Hence 


(9.13) c0°n(u-) +0 in H71(Rt x $1), as e 30. 
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Now we can apply Lemma 12.6 of Chap. 13 (Murat’s lemma) to deduce from 
(9.11)-(9.13) that 


(9.14) Oyn(ue) + Orq(ue) is precompact in H,)(R* x S$"). 


Now, let (71, q1) and (72, q2) be any two entropy-flux pairs, and consider the 
vector-valued functions 


(9.15) vj = (m(uy),q(uz)), wy = (a2(uy), —n2(us)), 
where w, is as in (9.5). By (9.14), we have 
(9.16) div vj, rotw; precompactin H,,/(Rt x S'). 


Also the L® bound on u; implies that v; and w; are bounded in L°(R* x S*), 


and a fortiori in L7,,(R* x S1). Therefore, we can apply the div-curl lemma, 


either in the form developed in the exercises after § 8 of Chap.5 or in the form 
developed in the exercises after § 6 of Chap. 13. We have 


(9.17) vy: wj > u-w in D(R* XS"), v= (I,%), w = (GM). 
In view of the L°°-bounds, we hence have 


mi (uj )qa(uj) — no(uj qi (uj) — 192 — eG 


9.18 
i) weak* in L®(Rt x S'). 


Recall that we want to show that any measure vy = A;,~, arising in the disin- 
tegration of the measure A in (9.5), is supported at a point. We are assuming that 
there are global coordinates (£1, €2) on 2 consisting of Riemann invariants. Let 


(9.19) R={€:a; < &; Sap} 


be a minimal rectangle (in €-coordinates) containing the support of v. The follow- 
ing provides the key technical result: 


Lemma 9.1. Ifa; < iia then each closed vertical side of R must contain a point 
where 0X2/0€, = 0. 


Proof. We have from (9.18) that 
(9.20) (v,mg2 — 291) = (¥,m)(Y, 92) — (Y,N2)(Y, 91) 
for all entropy-flux pairs (7;,q;). Let (n(k),q(k)) be a family of entropy-flux 


pairs of the form (8.39), with k € R, |k| large, so that n(k) > 0. Thus, for |k| 
large, we can define a probability measure ju, by 
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(¥, nk) f) 
(v,n(k)) 


We can take a subsequence k,, —> +00 such that 


(9.21) (Hi f) = 


(9.22) Hk, > [bt , Lak, > BW , weak* in M(Q). 


In view of the exponential factor e*§+ in 7(k), it is clear that 


(9.23) supp uw C RO {& = a7}. 


Now set AZ = (uu, A2). We claim that 


(9.24) (y, q— Az) > (ue, q— A2n), 


for every entropy-flux pair (7,q). To establish this, use (9.20) with (m,q1) = 
(7, q) and (72, q2) = (n(k), 4 Ne We get 


(v, na(k) — n(k)4q) (v, q(k)) 
(v, n(k)) (v,n(k)) 


Since, by (8.43), gg = A270 in the expansion (8.39), we have 


(v,q(k)) _ (v, Agn(k)) 
(v,n(k)) (uv, n(k)) 


Now, letting & = +k, and passing to the limit yield (*,A2) = AZ for (9.26). 
Similarly, 


(9.25) — (Vv, n) Ww (Vv, qd). 


(9.26) + O(k~*) = (ue, A2) + O(K7"). 


at en — (u*, r20), 


(9.27) 


so (9.25) yields (9.24) in the limit. 


ai use (9.20) with (m,q1) = (n(k), a(k)), (N2,q2) = (n(—k),q(—k)). 


(9.28) (v,m(k)g(-k) — n(=k)alk)) _ (vg(=k)) _ (uv, a(k)) 


(v,n(k))(v,m(—k)) (v,m(—k)) — (vn(k)) 


The right side converges to Ay — AZ as k = ky, —> +00. Meanwhile, note that 
n(k)q(—k) — n(—k)q(k) = O(k71). Also (v,n(k))(v,(—k)) — +00, faster 
than ek(ay —ay ~e) by the definition of R, ifay < ar Thus the left side of (9.28) 
tends to zero. We deduce that 


(9.29) A Ne. 
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The identities (9.24) and (9.29) imply that 


(9.30) (ut,q—Aegn) = (u,qg— Aen), 


for every entropy-flux pair (7), q). Now with (7, q) = (n(k), q(k)), we have 


(9.31) (u*,q— Aan) = et (u®, (qr — dom) ko? + O(k-2)), 


where 7k~! and q,k~! are the second terms in the expansion (8.39). If aj < af, 
the identity of these two expressions forces (u*,q1 — A2m) = 0. By (8.53), this 
implies 


(9.32) (iu. m52) =0 


Since y:* are probability measures and 7) > 0, this forces O\2/0E1 to change 
sign on supp pu, proving the lemma. 


Corollary 9.2. If (9.1) is genuinely nonlinear, so 01 /0€2 and 0X2 /0&, are both 
nowhere vanishing, then v is supported at a point. 


We therefore have the following result: 


Theorem 9.3. Assume that (9.1) satisfies the standard hypotheses and that solu- 
tions Uz to (9.2) satisfy (9.3). If (9.1) is genuinely nonlinear, then there is a 
sequence Ue, —> u, converging boundedly and pointwise a.e., such that u solves 
(9.1). Also, u satisfies the entropy inequality O.n(u) + Ozq(u) < 0, for every 
entropy-flux pair (n,q) such that n is convex (on a neighborhood of K). 


Certain cases of (9.1) that satisfy the standard hypotheses but for which gen- 
uine nonlinearity fails, not everywhere on Q, but just on a curve, are amenable to 
treatment via the following extension of Lemma 9.1: 


Lemma 9.4. If both characteristic fields of (9.1) are genuinely nonlinear outside 
a curve €2 = b(&1), with w strictly monotone, then v is supported at a point. 


Proof. By Lemma 9.1, each closed side of the rectangle R must intersect this 
curve, so it must go through a pair of opposite vertices of R; call them P and Q. 
By (9.32), we see that j:* and jz~ must be supported at these points. Thus (9.24) 
and (9.29) imply that 


(9.33) qQ) — A2(Q)n(Q) = q(P) — A2(P)n(P). 


We have the same sort of identity with A2 replaced by 1, so 


(9.34) [A2(Q) — A1(Q)]n(Q) = [A2(P) — Ar (P)] n(P), 
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for every entropy 7. In particular, we can take n(u) = uj; — Q;, 7 = 1,2, to 
deduce from (9.34) that P = Q, since the strict hyperbolicity hypothesis implies 
A2(P) — A1(P) # 0. This implies R is a point, so the lemma is proved. 


For an example of when this applies, consider the system (7.14), namely, 


V_ — We = O~* 


(9.35) ree een 


which, by (7.16), is strictly hyperbolic provided K’(v) 4 0. By (7.55), 


1 
(9.36) re -VA = +5 Ky? Kw, 


so we have genuine nonlinearity provided K”(v) ~ 0. However, in cases 
modeling the transverse vibrations of a string, by (7.12), we might have, for 
example, 


(9.37) K(v) =v+av’, 


for some positive constant a. Then K’(v) = 1 + 3av? > 0, but K”’(v) = 6av 
vanishes, at v = 0. In this case, Riemann invariants are given by (8.30), that is, 


(9.38) a ' | . /K"(s) ds, 


I 
g 


so genuine nonlinearity fails on the line £, = €_ (ie., 2 = €). Thus Lemma 9.4 
applies in this case. 

To make use of Theorem 9.3 and the analogous consequence of Lemma 9.4, 
we need to verify (9.3). The following result of [CCS] is sometimes useful for 
this: 


Proposition 9.5. Let O C 2 C R? be a compact, convex region whose boundary 
consists of a finite number of level curves y; of Riemann invariants, &;, such that 
V€; points away from O on y;; more precisely, 


(9.39) (u—y)-Vé(u) >0, for ver, yEO. 


If f € L©(S") and f(x) € K CC O forall x € S", then, for any < > 0, the 
solution to 


(9.40) Opte + Op F (ue) = €072Ue, ue(0,2) = f(z) 


exists on [0,00) x $1, and u,(t, x) € O. 


Proof. We remark that it suffices to prove the result under the further hypothesis 
that f € C™(S"). First, for any 6 > 0, consider 
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(9.41) Opts + Op F (ues) = €O72Ucs — SV p(ues), tes(0, 2) = f(x), 


where we pick y € © and take p(u) = |u — y|?. This has a unique local solution. 
If we show that uzs(t, 2) € O, for all (t, x), then it has a solution on [0, 00) x S?. 

If it is not true that ues(t, 2) € O for all (¢, x), there is a first tp > O such that, 
for some ro € S!, u(to, 20) € OO. Say u(to, xo) lies on the level curve 7;. Take 
the dot product of (9.41) with VE; (ues), to get (via (8.32)) 


On; (ues) oF Ag (tes) One; (ues) 


(9.42) ‘ "i 
= 6076; (es) — €(Oz U5) ° & (ues) OpUes — OVE; (ues) + Vp(ues). 


Our geometrical hypothesis on O implies 

(9.43) (Ones) * €j (tes )Ortcs > 0 and VE;(ues) - Ve(ues) > 0, 
at (to, Zo). Meanwhile, the characterization of (to, 7) implies 

(9.44) OnE; (ues) =0, and O2E;(ues) < 0, 


at (to, Zo). Plugging (9.43)-(9.44) into (9.42) yields 0,€; (ues) < 0 at (to, 2), an 
impossibility. Thus u-3 € O for all (t,x) € [0,00) x $1. 

Now, if (9.40) has a solution on [0, 7) x S1, analysis of the nonhomogeneous 
linear parabolic equation satisfied by wes = uz — Ues shows that ues > Ue on 
(0,7) x S1, as 5 — 0, so it follows that u-(t, 2) € O, and hence that (9.40) has a 
global solution, as asserted. 


As an example of a case to which Proposition 9.5 applies, consider the system 
(9.35), with K(v) given by (9.37), modeling transverse vibrations of a string. 
There are arbitrarily large, invariant regions O in Q = R? of the form depicted in 
Fig. 9.1. Here, OO = 71 U y2 U 73 U Ya, as depicted, and we take 


ecu [VRE as 


fi =€+onn, €3 = —€4 0n 3, 
€2 = €_ on 72, fa = —€_ on y4. 


(9.45) 


Another example, with Q = {(v,w) : 0 < uv < 1}, is depicted in Fig. 9.2. 
This applies also to the system (9.35), but with AK(v) given by (7.60). It models 
longitudinal waves in a string. In this case, there are invariant regions of the form 
O containing arbitrary compact subsets of 2. 

Since we have seen that Lemma 9.4 applies in these cases, it follows that 
the conclusion of Theorem 9.3, that is, the existence of global entropy-satisfying 
solutions, holds, given initial data with range in any compact subset of (2. 
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—, =const E_=const 


FIGURE 9.1 Setting for Proposition 9.5 


Exercises 


1. As one specific way to end the proof of Proposition 9.5, show that Wes¢ = Wes — Ueo 
satisfies 


OfWeso + Oz (GesoWesc) = 02 Wesa 7 dV p(tes) + oV p(tec), Wedo (0, x) = 0, 


where Gieso = ii DF (stes +(1-— 8)Ucc) ds. Deduce that there exists kK < oo such 
that, for e, 6,0 € (0, 1], 


K 
Gy llwese (t)llz2 < = llwese(t)llz2 + K(5 +0), 


FIGURE 9.2 Another Setting for Proposition 9.5 
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granted that wes(t, x), teo(t, x) € O, for all (t, x). Use Gronwall’s inequality to esti- 
mate ||wesa(t)||7.2, showing that, for fixed ¢ € (0, 1], it tends to zero as 6,0 — 0, 
locally uniformly in t € [0,00). Use this to show that ues > ue, as 6 > 0. 


10. Vibrating strings revisited 


As we have mentioned, the equation for a string vibrating in R* was derived in 
§ 1 of Chap. 2, from an action integral of the form 


(10.1) J(u) = /f[zlee? = F(ue(t,22))| de dt, 


IxXQ 


where « € 2 = [0,L], t € I = (to,t1). Assume that the mass density m is a 
positive constant. The stationary condition is 


(10.2) muy — F"(ur)« = 0, 
which is a second-order, k x k system. If we set 
(10.3) V=Uz, W= Ut, 
we get a first-order, (2h) x (2k) system: 


Ut — We = 0, 


(10.4) tie FG), 

where 

(10.5) K(v) = + pry). 
m 


Let us assume that F'(u,.) is a function of |u,.|? alone: 
(10.6) F(ue) = f(\uz|”). 


Then K(v) has the form 
2 / 2 
(10.7) K(v) = —f'(jol?)v. 
m 


We can write (10.4) in quasi-linear form as 


ve Bi @ - (bee i) te , 
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where, for b € R*, 


(10.9) DK(v)b= 2 P(uP)yb+ 4 Pr (ul2)(w- do, 
m m 

that is, 
2 if 2 4 i 2 2 

(10.10) DK(v) = —f'(lo[ 2+ — fF (lol) lvl Pe, 
m m 


P, being the orthogonal projection of R* onto the line spanned by v (if v ¥ 0). 
Writing (10.8) as 


(10.11) U, — A(U)Uz = 0, 


for U = (v,w)', we see that the eigenvalues of A(U) are given by 


(10.12) Spec A(u) = {4,/A; : Aj € Spec DK(v)}. 
Now, if k = 1, DK (v) is scalar: 


(10.13) DK(v) = =F"(v). 


m 


As long as F’’(v) > 0, the system (10.8) is strictly hyperbolic, with characteristic 


speeds 
/ 1 
At = +4/—F"(v). 
m 


In this case, the system (10.4) describes longitudinal vibrations of a string. The 
Riemann problem for this system was considered in (7.55)-(7.60), and DiPerna’s 


global existence theorem was applied in the discussion of Fig. 9.2. 
On the other hand, if k > 1, then 


(10.14) Spec DK(v) = {> F(wP), = Fol?) + =F" (HPO }, 


where the first listed eigenvalue has multiplicity k — 1 and the last one has mul- 


tiplicity 1. We can rewrite these eigenvalues, using the notion of “tension” T(r), 
defined so that 


(10.15) F'(v) = Tel) 


that is, so 2f’(|v|?) = T(|v|)/|v|. A calculation gives 
2 (lvl?) + 4F"(lul?)lel? = T'(lel), 


so 


10. Vibrating strings revisited 541 


f(r) 


0 a 


FIGURE 10.1 Graph of f 


T(r) 


FIGURE 10.2 Graph of T 


ie 1 
(10.16) Spec DK(v) = {2 ua Loup}. 


The basic expected behavior of the function f(r) is discussed in § 7, around the 
formula (7.60). The function f(r) should be expected to behave as in Fig. 10.1. 
For r larger than a certain a, f(r) should increase. On the other hand, also f(r) 
should get very large as r \, 0, since the material of the string would resist 
compression. This means for the tension T(r) that T(r) > 0 for r > a and 
T(r) <0 for 0 <r < a. On the other hand, we expect T(r) to increase whenever 
r increases, so T’(r) > 0 for all r. Such behavior of the tension is depicted in 
Fig. 10.2. 

We conclude that when |v| > a, then 


(10.17) Spec A(U) = {x2 tee Er (w)} 


m | 
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consists of real numbers. If k = 2, we have four eigenvalues. These are all distinct 
as long as f”’(|v|*) 4 0, that is, as long as the function f(r) is strongly convex. 
If this convexity fails somewhere on |v| > a, then the system (10.4) will not be 
strictly hyperbolic, but it will be symmetrizable hyperbolic, as long as |v| > a. 

On the other hand, when |v| < a, then Spec A(U), which is still given by 
(10.17), has two purely imaginary elements, as well as two real elements (the 
former being eigenvalues with multiplicity & — 1). Thus (10.4) is not hyperbolic 
in the region |v| < a. 

Let us concentrate for now on the region |v| > a, where (10.4) is hyperbolic, 
and examine whether it is genuinely nonlinear. We consider the case k = 2. Let 
us denote the two eigenvalues of DK(v) given by (10.16) by \,(v): 


T (ul) 


1 
m | 


(10.18) Ai(v) = 


Thus 


f’'(\vl?) > 0 => Aa(v) > Ar(v), 


(10.19) Rae 
Ff" (ule) < 0 => ra(v) < Ar(v). 


From (10.10) we see that we can take as eigenvectors of DK (v) 


(10.20) ro(v) =v, ri(v) = Jv, 


where J : R? — R? is counterclockwise rotation by 90°. It follows that eigen- 
vectors of A(U) corresponding to the eigenvalues 


(10.21) Mjt = £4/ A; (v) 


are given by 


(10.22) be =( ri(0) ). 


Thus 


(10.23) rye Vis = try(0) Vso) = ty rile) V0). 


Now, by (10.20), if we use polar coordinates (r, 9) on R?, with r?2 = v? + v3, 
then 


(10.24) Rg =r— 
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so 
1 
(10.25) r.-VAi(v) =0, ra: VAa(v) = —T” ({v)) 0. 
m 


Thus we see that within the hyperbolic region |v| > a, (10.4) has two lin- 
early degenerate fields and two fields that are genuinely nonlinear as long as 
T"(\v|) £0. 

We now describe an interesting complication that arises in the treatment of 
the system (10.4) when k > 2. Namely, even if initial data lie entirely in the 
hyperbolic region, the solution might not stay in the hyperbolic region. Consider 
the following simple example, with & = 2. We let v(0, x), w(0, x) be initial data 
for a purely longitudinal wave, so that the motion of the string is confined to the 
2x ,-axis. Thus, we take 


(10.26) v(0,2) = .. _) , w(0,2) = Guy 


For all t > 0, the solution will have the form 


(10.27) v(t, «) = Ge ”) , w(t,2) = = 


where the pair (v1, wi) satisfies the & = 1 case of (10.4). 

Suppose K’(v) is given by (10.7) and T(|v|) = 2f’(|v|*)|v| has the behavior 
indicated in Fig. 10.2; also, let us assume f is real analytic on (0,00). Introduce 
another variable 77, and let (ui(t, xin), wi(t, x, n)) solve the k = 1 case of (10.4), 
with initial data 


v1(0, 2,9) =a+n, 


10.28 
( ) w1(0, 2,7) = —b sina, 


where b is some positive constant. By the Cauchy—Kowalewsky theorem, there is 
a T > 0 such that there is a unique, real-analytic solution defined for |t| < T, for 
all « € R (periodic of period 27), and for all |7| < a/2. Note that 


(10.29) O01 (0,0, 7) = —b. 


It follows easily from the implicit function theorem that, for all 7 > 0 sufficiently 
small, there exists t(7) € (0,7) such that 


(10.30) v1(t(7),0,7) <a. 
This is a well-behaved solution to the longitudinal wave problem, but the solution 


so produced to the k = 2 case of (10.4), having the form (10.27), clearly has the 
property that 
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(10.31) (v(t(n),0,n), w(t(n),0,7))" = (vr(t(n), 0,7), 05 wr (t(m), 0, 0), 0)! 


does not belong to the hyperbolic region, despite the fact that the initial data do. 

Note that, for the system under consideration, solutions to the Riemann prob- 
lem for (v1, w1) have the behavior discussed in § 7, illustrated by Fig. 7.4 there. 
For example, the situation illustrated in Fig. 10.3 can arise. Here, we have Rie- 
mann data 


(10.32) Uye = (vie, wre)’, Uir = (vir, Wir)’, vie >a, Vir > a, 
but the intermediate state U;,,, has the form 
(10.33) Uim = (Yim; Wim)’, Vim <a. 

This is also a well-behaved solution to the longitudinal wave problem, but the 
solution so produced to the k = 2 case of (10.4) is the following. We have the 
Riemann problem 
(10.34) Ue = (v1, 0; we,0)',  Ur(vir, 0; wir, 0)", 
and a weak solution to (10.4), involving two jumps, with intermediate state 
(10.35) Cra = Chee, OPO)": 

Clearly, U,,, does not belong to the hyperbolic region for the k = 2 case of (10.4), 
even though U, and U,. do belong to the hyperbolic region. 


M. Shearer, [Sh1, Sh2] (see also [CRS]), has proposed a method for solv- 
ing Riemann problems for a class of systems, including the system for vibrating 


FIGURE 10.3 Connecting Ui, to Ui, 
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strings considered here, which leaves the hyperbolic region invariant. The solu- 
tion produced by this method to the Riemann problem described in the preceding 
paragraph has an intermediate state U,,, different from the one described above. 
The situation is depicted in Fig. 10.4. Here Uim, = (Vim, Wim) With Vim < 0 (in 
fact, Vim < —a). The physical interpretation is that the string develops a kink and 
doubles back on itself. For motion strictly confined to one dimension, this would 
not be allowable, as it involves interpenetration of different string segments. If 
the string has two or more dimensions in which to move, one thinks of these seg- 
ments as lying side by side; however, one does not ask which segment lies on 
which side; for example, for k = 2, one does not ask whether the configuration is 
as in Fig. 10.5A or as in Fig. 10.5B. 

Of course, there is only one real-analytic solution with the initial data (10.26)- 
(10.28); there is no real-analytic modification that stays in the hyperbolic region. 
In [PS] there is a study of the behavior of approximations of such solutions via 
Glimm’s scheme. It is found that typically these approximations do not converge, 
even weakly, to the smooth solution. 

Further material on systems that change type can be found in [KS]. Also, 
papers in [KK3] deal with systems for which strict hyperbolicity can fail, as can 
happen in the hyperbolic region for (10.4) if f’’(r) = 0 for some r > a?. 


FIGURE 10.4 More Complex Connection 
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i a 
(a) (b) 


FIGURE 10.5 Crinkled Strings 


Exercises 


1. Work out the equations for radially symmetric vibrations of a two-dimensional mem- 
brane in R?. Perform an analysis parallel to that done in this section for the vibrating 
string system. 
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Euler and Navier-Stokes Equations 
for Incompressible Fluids 


Introduction 


This chapter deals with equations describing motion of an incompressible fluid 
moving in a fixed compact space M, which it fills completely. We consider two 
types of fluid motion, with or without viscosity, and two types of compact space, 
a compact smooth Riemannian manifold with or without boundary. The two types 
of fluid motion are modeled by the Euler equation 


(0.1) au + Vuu = —grad p, divu = 0, 


for the velocity field u, in the absence of viscosity, and the Navier-Stokes equation 


3) 
(0.2) 7 +V,u=vLu—gradp, divu=0, 
in the presence of viscosity. In (0.2), v is a positive constant and £ is the second- 
order differential operator 


(0.3) Lu = div Def u, 


which on flat Euclidean space is equal to Au, when div u = 0. If there is a 
boundary, the Euler equation has boundary condition n - u = 0, that is, wu is 
tangent to the boundary, while for the Navier-Stokes equation one poses the no- 
slip boundary condition u = 0 on OM. 

In § 1 we derive (0.1) in several forms; we also derive the vorticity equation 
for the object that is curl « when dim M = 3. We discuss some of the classical 
physical interpretations of these equations, such as Kelvin’s circulation theorem 
and Helmholtz’ theorem on vortex tubes, and include in the exercises other topics, 
such as steady flows and Bernoulli’s law. These phenomena can be compared with 
analogues for compressible flow, discussed in § 6 of Chap. 16. 
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Sections 2-6 discuss the existence, uniqueness and regularity of solutions to 
(0.1) and (0.2), on regions with or without boundary. We have devoted separate 
sections to treatments first without boundary and then with boundary, for these 
equations, at a cost of a small amount of redundancy. By and large, different 
analytical problems are emphasized in the separate sections, and their division 
seems reasonable from a pedagogical point of view. Section 4 examines Euler 
flows on a rotating surface, which brings in a modification of (0.1), incorporating 
the Coriolis force. 

The treatments in §§ 2-6 are intended to parallel to a good degree the treatment 
of nonlinear parabolic and hyperbolic equations in Chaps. 15 and 16. Among the 
significant differences, there is the role of the vorticity equation, which leads to 
global solutions when dim M = 2. For dim M > 3, the question of whether 
smooth solutions exist for all ¢ > 0 is still open, with a few exceptions, such 
as small initial data for (0.2). These problems, as well as variants, such as free 
boundary problems for fluid flow, remain exciting and perplexing. 

In §7 we tackle the question of how solutions to the Navier-Stokes equations 
on a bounded region behave when the viscosity tends to zero. We stick to two spe- 
cial cases, in which this difficult question turns out to be somewhat tractable. The 
first is the class of 2D flows on a disk that are circularly symmetric. The second is 
a class of 3D circular pipe flows, whose detailed description can be found in § 7. 
These cases yield convergence of the velocity fields to the fields solving associ- 
ated Euler equations, though not in a particularly strong norm, due to boundary 
layer effects. Section 8 investigates how such velocity convergence yields infor- 
mation on the convergence of the flows generated by such time-varying vector 
fields. 

In Appendix A we discuss boundary regularity for the Stokes operator, needed 
for the analysis in § 6. 


1. Euler’s equations for ideal incompressible fluid flow 


An incompressible fluid flow on a region (2 defines a one-parameter family of 
volume-preserving diffeomorphisms 


(1.1) F(t,-):Q——3Q, 
where 2) is a Riemannian manifold with boundary; if OQ is nonempty, we suppose 
it is preserved under the flow. The flow can be described in terms of its velocity 


field 


(1.2) u(t,y) = Fi(t,z) €T,Q, y=F(t,2), 
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where F;(t,x) = (0/0t)F(t, x). If y € OQ, we assume u(t, y) is tangent to 
OQ. We want to derive Euler’s equation, a nonlinear PDE for u describing the 
dynamics of fluid flow. We will assume the fluid has uniform density. 

If we suppose there are no external forces acting on the fluid, the dynamics are 
determined by the constraint condition, that F’(t,-) preserve volume, or equiva- 
lently, that div u(t,-) = 0 for all t. The Lagrangian involves the kinetic energy 
alone, so we seek to find critical points of 


1 
(1.3) LPS (Fi(t,x), F(t, x)) dV dt, 


on the space of maps F : I x Q > Q (where I = [to,t1]), with the volume- 
preserving property. 

For simplicity, we first treat the case where (2 is a domain in R”. A variation 
of F is of the form F'(s,t,x), with OF /Os = v(t, F(t,x)), at s = 0, where div 
v = 0, v is tangent to OO, and v = 0 for t = to and t = t,. We have 


DL(F)u = [fae “u(t, F(t,2))) dV dt 
[feree), “v(t, Flt,2))) dV dt 
= - [[ (Gru veur) dV dt. 


The stationary condition is that this last integral vanish for all such v, and hence, 
for each t, 


(1.5) [ (Gre vou) dv =0, 


(1.4) 


Q 


for all vector fields v on 2 (tangent to OQ), satisfying div v = 0. 
To restate this as a differential equation, let 


(1.6) VYo= {ve C™(Q, TQ) : div v = 0, v tangent to an}, 


and let P denote the orthogonal projection of L?(Q, TQ) onto the closure of the 
space V,. The operator P is often called the Leray projection. The stationary 
condition becomes 


Ou 
(1.7) BE + P(u- Vu) = 0, 


in addition to the conditions 


(1.8) divu=0 onQ 
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and 
(1.9) u tangent to 0Q. 


For a general Riemannian manifold , one has a similar calculation, with 
u- Vu in (1.5) generalized simply to V,,u, where V is the Riemannian con- 
nection on 2. Thus (1.7) generalizes to 


Euler’s equation, first form. 


Ou 
(1.10) 5p t P(Vu) =0. 


Suppose now {2 is compact. According to the Hodge decomposition, the 
orthogonal complement in L?(Q, 7’) of the range of P is equal to the space 


{grad p: p € H'(Q)}. 


This fact is derived in the problem set following § 9 in Chap. 5, entitled “Exercises 
on spaces of gradient and divergence-free vector fields”; see (9.79)-(9.80). Thus 
we can rewrite (1.10) as 


Euler’s equation, second form. 


(1.11) oe + Yuu = ~grad p. 


Here, p is a scalar function, determined uniquely up to an additive constant 
(assuming 2) is connected). The function p is identified as “pressure.” 

It is useful to derive some other forms of Euler’s equation. In particular, let w 
denote the 1-form corresponding to the vector field u via the Riemannian metric 
on Q. Then (1.11) is equivalent to 


dit - 
(1.12) ay 1 Vuit = —ap. 


We will rewrite this using the Lie derivative. Recall that, for any vector field X, 
VuX =LyX+Vxu, 

by the zero-torsion condition on V. Using this, we deduce that 

(1.13) (Lyti — Vytt, X) = (4, Vxu). 

In case (ti, v) = (u,v) (the Riemannian inner product), we have 


(1.14) (a, Vu) = 5(alul?, X), 
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using the notation |u|? = (u, u), so (1.12) is equivalent to 


Euler’s equation, third form. 
(1.15) 2 + fyii = a(S !2 ) 
. ; uti = d\ 5 lu p}. 
Writing the Lie derivative in terms of exterior derivatives, we obtain 
ss ; -— 
(1.16) St (dal) ju = a(S Iu +p). 


Note also that the condition div u = 0 can be rewritten as 


(1.17) 6a = 0. 
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In the study of Euler’s equation, a major role is played by the vorticity, which 


we proceed to define. In its first form, the vorticity will be taken to be 


(1.18) i = di, 


for each t a 2-form on (Q. The Euler equation leads to a PDE for vorticity; indeed, 


applying the exterior derivative to (1.15) gives immediately the 


Vorticity equation, first form. 


Ow 
1.1 — p= 
(1.19) Bt + Lyw =0, 
or equivalently, from (1.16), 
Ow 7 
(1.20) 3 + d(w]u) =0. 


It is convenient to express this in terms of the covariant derivative. In analogy 


to (1.13), for any 2-form ( and vector fields X and Y, we have 


(Vu _ Ly B)(X, Y) = B(V xu, Y) + A(X, Vyu) 


1.21 
oa = (B#Vu)(X,Y), 


where the last identity defines #4Vu. Thus we can rewrite (1.19) as 


Vorticity equation, second form. 


(1.22) ac + Vt — w#Vu = 0. 
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It is also useful to consider vorticity in another form. Namely, to w we associate 
a section w of A"~?T (n = dim Q), so that the identity 


(1.23) wAa= (w,ayw 


holds, for every (n — 2)-form a, where w is the volume form on 2. (We assume 
Q. is oriented.) The correspondence w <> w given by (1.23) depends only on the 
volume element w. Hence 


(1.24) divu=0=> L,% = Lyw, 


so (1.19) yields the 


Vorticity equation, third form. 


(1.25) op t Lu = 0. 


This vorticity equation takes special forms in two and three dimensions, 
respectively. When dim 2 = n = 2, w is ascalar field, often denoted as 


(1.26) w= rotu, 
and (1.25) becomes the 
2-D vorticity equation. 


(1.27) Oe +a grad w= 0. 


This is a conservation law. As we will see, this has special implications for two- 
dimensional incompressible fluid flow. If n = 3, w is a vector field, denoted as 


(1.28) w= curl u, 


and (1.25) becomes the 


3-D vorticity equation. 


0 
(1.29) a + [u, w] =0, 
or equivalently, 
0 
(1.30) EV = Vat =O. 
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Note that (1.28) is a generalization of the notion of the curl of a vector field on 
flat R°. Compare with material in the second exercise set following § 8 in Chap. 5. 
The first form of the vorticity equation, (1.19), implies 


(1.31) (0) = (F)*w(t), 


where F'(x) = F(t,x), w(t)(x) = W(t, x). Similarly, the third form, (1.25), 
yields 


(1.32) w(t,y) =A" *DF*(z) w(0,2), y= Fit,2), 


where DF" (x) : T,Q — Ty{Q is the derivative. In case n = 2, this last identity is 
simply w(t, y) = w(0, 2), the conservation law mentioned after (1.27). 

One implication of (1.31) is the following. Let S be an oriented surface in 2, 
with boundary C; let S(t) be the image of S under F‘, and C(t) the image of C; 
then (1.31) yields 


(1.33) / ae [eo 


S(t) 3 


Since w = du, this implies the following: 


Kelvin’s circulation theorem. 


(1.34) / ee / ii(0). 


We take a look at some phenomena special to the case dim 2. = n = 3, where 
the vorticity w is a vector field on Q, for each t. Fix to, and consider w = w(to). 
Let S be an oriented surface in (2, transversal to w. A vortex tube 7 is defined 
to be the union of orbits of w through S, to a second transversal surface S2 (see 
Fig. 1.1). For simplicity we will assume that none of these orbits ends at a zero 
of the vorticity field, though more general cases can be handled by a limiting 
argument. 

Since dw = d?% = 0, we can use Stokes’ theorem to write 


Cc. Cc 
g 2 


FIGURE 1.1 Vortex Tube 
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(1.35) o= fao= fw. 
T 


oT 


Now OT consists of three pieces, S and S2 (with opposite orientations) and the 
lateral boundary £, the union of the orbits of w from OS to OS. Clearly, the 
pull-back of w to L is 0, so (1.35) implies 


(1.36) fo fo. 


Ss So 


Applying Stokes’ theorem again, for w = du, we have 


Helmholtz’ theorem. For any two curves C’, C2 enclosing a vortex tube, 


(1.37) fue fe 


Cc C2 


This common value is called the strength of the vortex tube T. 


Also note that if 7 is a vortex tube at to = 0, then, for each t, T(t), the 
image of 7 under F", is a vortex tube, as a consequence of (1.32) (with n = 3), 
and furthermore (1.34) implies that the strength of T(t) is independent of t. This 
conclusion is also part of Helmholtz’ theorem. 

To close this section, we note that the Euler equation for an ideal incompress- 
ible fluid flow with an external force f is 


) 
op t Vu = ~erad p+ f. 
in place of (1.11). If f is conservative, of the form f = — grad y, then (1.12) is 


replaced by 
Ou 
= wu = —d : 
a Vull (p+ ¢) 


Thus the vorticity w = du continues to satisfy (1.19), and other phenomena dis- 
cussed above can be treated in this extra generality. 

Indeed, in the case we have considered, of a completely confined, incompress- 
ible flow of a fluid of uniform density, adding such a conservative force field has 
no effect on the velocity field u, just on the pressure, though in other situations 
such a force field could have more pronounced effects. 


REMARK. The Lagrangian approach given in (1.3)—(1.8) can be seen as a deriva- 
tion of the Euler equations as equations for the geodesic flow on the group of 
diffeomorphisms of 2, an infinite-dimensional Lie group provided with a cer- 
tain Riemannian metric. This picture is emphasized in a number of publications, 
such as [Ar, EM] and [AK]. A number of other evolution equations, such as the 
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Korteweg-de Vries equation and the cubic nonlinear Schrddinger equation, can be 
derived as geodesic equations on other such Lie groups, and such derivations in 
turn point the way to important conservation laws. There are many publications 
on this. An exposition is given in [T4]. 


Exercises 


1. Using the divergence theorem, show that whenever div u = 0, wu tangent to OQ, 2 
compact, and f € C™°(Q), we have 


[orev =o. 
Q 


Hence show that, for any smooth vector field _X on Q, 


[(vox, X) dV =0. 
2 


From this, conclude that any (sufficiently smooth) w solving (1.7)—(1.9) satisfies the 
conservation of energy law 


d 
(1.38) allu(t)lizacay = 0. 


2. When dim 2 = n = 3, show that the vorticity field w is divergence free. 
(Aint: div curl.) 

3. If_u, v are vector fields, u the 1-form associated to u, it is generally true that Vyt = 
Vou, but not that £,u = Lyu. Why is that? 

4. A fluid flow is called stationary provided wu is independent of t. Establish Bernoulli’s 
law, that for a stationary solution of Euler’s equations (1.7)-(1.9), the function 
(1/2)|u| + p is constant along any streamline (i.e., an integral curve of w). 

In the nonstationary case, show that 


5 (2 — cu) Iu)? = Lup. 


2\Ot 
(Hint: Use Euler’s equation in the form (1.16); take the inner product of both sides 
with wu.) 
5. Suppose dim 2 = 3. Recall from the auxiliary exercise set after § 8 in Chap.5 the 
characterization 


uxv=X => X =«(G Nd). 
Show that the form (1.16) of Euler’s equation is equivalent to 


du 


(1.39) BE 


1 
+ (curl u) x w= —grad (5 Ju +p). 
Also, if 2 C R3, deduce this from (1.11) together with the identity 


grad(u-v) =u-Vu+vu-Vu+ux curlv+vx curlu, 
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which is derived in (8.63) of Chap. 5. 


6. Deduce the 3-D vorticity equation (1.30) by applying curl to both sides of (1.39) and 
using the identity 
curl(u x v) =v-Vu-—u- Vu (div v)u — (div u)v, 
which is derived in (8.62) of Chap.5. Also show that the vorticity equation can be 
written as 
1 
(1.40) wr + Vuw = (Def u)w, Def v = a (Ve + Vv’). 
(Hint: w x w = 0.) 

7. In the setting of Exercise 5, show that, for a stationary flow, (1/2)|u|? + p is constant 
along both any streamline and any vortex line (i.e., an integral curve of w = curl wu). 

8. For dim 2 = 3, note that (1.29) implies [u, w] = 0 for a stationary flow, with w = 
curl wu. What does Frobenius’s theorem imply about this? 

9. Suppose uw is a (sufficiently smooth) solution to the Euler equation (1.11), also satisfy- 
ing (1.9), namely, u is tangent to OQ. Show that if u(0) has vanishing divergence, then 
u(t) has vanishing divergence for all t. (Hint: Use the Hodge decomposition discussed 
between (1.10) and (1.11).) 

10. Suppose w, the 1-form associated to u, and a 2-form w satisfy the coupled system 

on + wlu = —d®, 

(1.41) _ 

ow a()u) =0 

Ot — 
Show that if w(0) = da(0), then w(t) = di(t) for all t. (Hint: Set W(t) = da(t), 
so by the first half of (1.41), OW/Ot + d(w|u) = 0. Subtract this from the second 
equation of the pair (1.41).) 

11. If w generates a 1-parameter group of isometries of 2, show that w provides a sta- 
tionary solution to the Euler equations. (Hint: Show that Def u = 0 > Vyu = 
(1/2) grad |u|?.) 

12. A flow is called irrotational if the vorticity wW vanishes. Show that if w(0) = 0, then 
w(t) = 0 for all ¢, under the hypotheses of this section. 

13. Ifa flow is both stationary and irrotational, show that Bernoulli’s law can be strength- 
ened to i 

5 |u|? + p is constant on 2. 

The common statement of this is that higher fluid velocity means lower pressure 

(within the limited set of circumstances for which this law holds). (Hint: Use (1.16).) 
14. Suppose © is compact. Show that the space of 1-forms &@ on 2 satisfying 

6u=0, du=O0onQ, (v,u) =0o0n dQ, 

is the finite-dimensional space of harmonic 1-forms H%', with absolute boundary con- 

ditions, figuring into the Hodge decomposition, introduced in (9.36) of Chap. 5. Show 

that, for %(0) € H#, Euler’s equation becomes the finite-dimensional system 

Ou ze 

(1.42) a + PA(V. i) =0, 
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where Pj is the orthogonal projection of L?(Q, At) onto H/. 


15. In the context of Exercise 14, show that an irrotational Euler flow must be sta- 
tionary, that is, the flow described by (1.42) is trivial. (Hint: By (1.16), 0u/Ot = 
—d((1/2)|u|* + p), which is orthogonal to Hi.) 

16. Suppose Q is a bounded region in R*, with k + 1 (smooth) boundary components 7;. 
Show that 1/4 is the k-dimensional space 

Hi = {«df : f € C~(Q), Af =00nQ, f = c; on 4;}, 
where the c; are arbitrary constants. Show that a holomorphic diffeomorphism F’ : 
= O takes Hi}(Q) to Hi*(O). 

17. If Q is a planar region as in Exercise 16, show that the space V, of velocity fields for 
Euler flows defined by (1.6) can be characterized as 

Vz ={u:t=x«df, f ¢ C°(Q), f =; on 75}. 
Given u in this space, an associated f is called a stream function. Show that it is 
constant along each streamline of wu. 

18. In the context of Exercise 17, note that w = rot u = —Af. Show that u- Vw = 0, 
hence Ow/Ot = 0, whenever f satisfies a PDE of the form 

(1.43) Af =®(f)onQ, f =c; 0n4;. 

19. When «(0) = df for f satisfying (1.43), show that the resulting flow is stationary, 
that is, Ou /Ot = 0, not merely Ow/Ot = 0. (Hint: In this case, & satisfies the linear 
evolution equation 

- + Pi(w * i) =0, 
as a consequence of (1.16). It suffices to show that P“(w * &(0)) = 0, but indeed 
w * a(0) = —®(f)df = deb(f).) 
Note: When Q2 is simply connected, the argument simplifies. 

20. Let 2 be a compact Riemannian manifold, wu a solution to (1.7)—-(1.9), with associated 

vorticity w. When does 
Ow Ou 
— =0, foralt —= — =0? 
at = at 
Begin by considering the following cases: 
(a) H'(Q) = 0. (Hint: Use Hodge theory.) 
(b) dim H? (Q) = 1. (Aint: Use conservation of energy.) 
(c) Q CC R?. (Hint: Generalize Exercise 19.) 

21. Using the exercises on “spaces of gradient and divergence-free vector fields” in § 9 of 
Chap. 5, show that if we identify vector fields and 1-forms, the Leray projection P is 
given by 

(1.44) Pu = P§u+ PAu, ie. (I- P)u = Pfu = dbG4u. 

22. Let 2 be a smooth, bounded region in R° and wu a solution to the Euler equation on 


I x Q, where I is a t-interval containing 0. Assume the vorticity w vanishes on OQ at 
t=0. 

(a) Show that w = 0 on OQ, for all t € I. 

(b) Show that the quantity 
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(1.45) h(t) = ic x)- w(t, x) dx, 


Q 


is independent of t. This is called the helicity. (Hint: Use formulas for the adjoint of 
Vx when div u = 0; ditto for V,,; recall Exercise 2.) 
(c) Show that the quantities 


(1.46) I(t) = Je x w(t,v) dx, A(t) = if \ar|? w(t, wv) dar 


2 2 


are independent of t. These are called the impulse and the angular impulse, 
respectively. 
Consider these questions when the hypothesis on w is relaxed to w tangent to OO at 
t=0. 

23. Extend results on the conservation of helicity to other 3-manifolds (, via a computa- 
tion of 


(1.47) (A: + Lu) (GA W). 


24. If we consider the motion of an incompressible fluid of variable density p(t, x), the 
Euler equations are modified to 


(1.48) p(ue+ Vuu) =—gradp, pet Vup =0, 


and, as before, div u = 0, wu tangent to OQ. Show that, in this case, the vorticity 
w = du satisfies 


(1.49) Ow + Lut = p 7dp A dp. 


(Results in subsequent sections will not apply to this case.) 


2. Existence of solutions to the Euler equations 


In this section we will examine the existence of solutions to the initial value prob- 
lem for the Euler equation: 


(2.1) —+PV,u=0, u(0)=u, 


given div uo = 0, where P is the orthogonal projection of L?(M,TM) onto 
the space V, of divergence-free vector fields. We suppose / is compact without 
boundary; regions with boundary will be treated in the next section. 

We take an approach very similar to that used for symmetric hyperbolic equa- 
tions in § 2 of Chap. 16. Thus, with J- a Friedrichs mollifier such as used there, 
we consider the approximating equations 
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Ouse 
Ot 


(2.2) + PJeVu.Jette =0, ue (0) = uo. 


As in that case, we want to show that u- exists on an interval independent of ¢, 
and we want to obtain uniform estimates that allow us to pass to the limit e > 0. 
We begin by estimating the L?-norm. Noting that u-(t) = Pu.(t), we have 


d 
(2.3) dt I|ue(t)||Z2 = —2(PJeVu_ Jete, Ue) 
= —2(Vu. Sete, Jette). 

Now, generally, we have 
(2.4) View = —-V yw — (div v)w, 
as shown in § 3 of Chap. 2. Consequently, when div v = 0, we have 
(2.5) (Vy»w, w) = —(Vyw,w) = 0. 
Thus (2.3) yields (d/dt)||u-(t)||7.2 = 0, or 
(2.6) ue (€)Ilz2 = luoll 22. 
It follows that (2.2) is solvable for all t € R, when e > 0. 

The next step, to estimate higher-order derivatives of u-, is accomplished in 
almost exact parallel with the analysis (1.8) of Chap. 16, for symmetric hyperbolic 
systems. Again, to make things simple, let us suppose IZ = T”; modifications for 


the more general case will be sketched below. Then P and J- can be taken to be 
convolution operators, so P, J-, and D® all commute. Then 


d a a Le 
(2.7) q WPeue(Mlliz = —2(D° PIeVu, Jette, D*ue) 


= —2(D°L-Jeue, D° Jeue), 


where we have set 


(2.8) Lew = L(u,, D)u, 
with 
(2.9) L(v, D)w = Vow, 


a first-order differential operator on w whose coefficients L;(v) depend linearly 
on v. By (2.4), 
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(2.10) L-+ Li =0, 


since div uz = 0, so (2.7) yields 
d a 2 a a 
(2.11) 77 ||D°ue(t)||Z2 = 2([Le, D°|Jeue, D° Jee). 


Now, just as in (1.13) of Chap. 16, the Moser estimates from § 3 of Chap. 13 yield 


I[Le, Dull 2 
212) SCV (lj (ue)ll allel + IVLj(ue)|l2- ljhs). 
J 


Keep in mind that L;(u-) is linear in u-. Applying this with w = J-u-, and 
summing over |a| < k, we have the basic estimate 


d 
(2.13) a Mellie S Clue llc llue()llire: 


parallel to the estimate (1.15) in Chap. 16, but with a more precise dependence 
on ||uwe(t)||c1, which will be useful later on. From here, the elementary argu- 
ments used to prove Theorem 1.2 in Chap. 16 extend without change to yield the 
following: 


Theorem 2.1. Given up € H*(M), k > n/2 +1, with div ug = 0, there is a 
solution u to (2.1) on an interval I about 0, with 


(2.14) u € L@(I,H*(M)) 2 Lip(I, H®-1(M)). 


We can also establish the uniqueness, and treat the stability and rate of con- 
vergence of u- to u, just as was done in Chap. 16, § 1. Thus, with « € [0,1], we 
compare a solution u to (2.1) to a solution u_ to 


Ouse 


(2.15) 


+ PIeVu.Jee =0, te(0) = vo. 


Setting v = u—Ue, we can form an equation for v analogous to (1.25) in Chap. 16, 
and the analysis (1.25)—(1.36) there goes through without change, to give 


(2.16) |lv(#)|I2 < Ko(t) (luo — voll2 + Ka(t)IIT — JelI2are—1,22))- 
Thus we have 


Proposition 2.2. Given k > n/2 + 1, solutions to (2.1) satisfying (2.14) are 
unique. They are limits of solutions uz to (2.2), and for t € I, 
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(2.17) ||u(t) = ue(t)|| 2 < Ky(t)||L = Je\| cuHe-1,12)- 


Continuing to follow Chap. 16, we can next look at 


d 
(2.18) qlDeyeu()llz2 = —2(D* JePV wu, D* Jeu) 


= —2(D° J-L(u, D)u, D* Jeu), 


given the commutativity of P with D°J,, and then we can follow the analysis of 
(1.40)-(1.45) given there without any change, to get 


d 
(2.19) Gq eu Ollie < C(L+ elle) lle ie: 


for the solution wu to (2.1) constructed above. Now, as in the proof of Proposition 
5.1 in Chap. 16, we can note that (2.19) is equivalent to an integral inequality, and 
pass to the limit ¢ — 0, to deduce 


d 
(2.20) q Ollie S C(L+ (ello) lu lie: 


parallel to (1.46) of Chap. 16, but with a significantly more precise dependence 
on ||u(t)||c1. Consequently, as in Proposition 1.4 of Chap. 16, we can sharpen the 
first part of (2.14) to 

u € C(I, H*(M)). 


Furthermore, we can deduce that if wu € C(I, H"(M)) solves the Euler equation, 
I = (—a,}), then u continues beyond the endpoints unless ||u(t)||c1 blows up 
at an endpoint. However, for the Euler equations, there is the following important 
sharpening, due to Beale-—Kato—Majda [BKM]: 


Proposition 2.3. [fu € C(I, H*(M)) solves the Euler equations, k > n/2 +1, 
and if 


(2.21) sup ||w(t)||L~ < K < oo, 
tel 


where w is the vorticity, then the solution u continues to an interval I', containing 
T in its interior, u € C(I’, H*(M)). 


For the proof, recall that if u(t) and w(t) are the 1-form and 2-form on M, 
associated to u and w, then 


(2.22) w = dt, 6u = 0. 
Hence dw = ddt + dot = Ati, where A is the Hodge Laplacian, so 


(2.23) i = Gow + Pou, 
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where Po is a projection onto the space of harmonic 1-forms on M, which is a 
finite-dimensional space of C'°°-forms. Now Go is a pseudodifferential operator 
of order —1: 


(2.24) Gi = A€OPS"'(M). 


Consequently, ||Gow]|| q1.2 < C>||w||ze for any p € (1,00). This breaks down 
for p = oo, but, as we show below, we have, for any s > n/2, 


(2.25) || Ad||cr < C(1 + log* ||| zs) 


||| potd. 
Therefore, under the hypothesis (2.21), we obtain an estimate 
(2.26) lu) llor < C(1 + log* |lullzx), 


provided k > n/2 + 1, using (2.23) and the facts that ||| 72-1 < c||ul| qx and 
that ||u(t)||,2 is constant. Thus (2.20) yields the differential inequality 


d 
(2.27) ap LC t lost yy, y(t) = llu(O)|lire- 


Now one form of Gronwall’s inequality (cf. Chap. 1, (6.19)-(6.21)) states that if 
Y(t) solves 


dY 


(2.28) ae F(t,Y), Y(0) = y(0), 


while dy/dt < F(t, y), andif OF /Oy > 0, then y(t) < Y(t) fort > 0. We apply 
this to F(t, Y) = C(1 + logt Y)Y, so (2.28) gives 


Y 
(2.29) / a = Ct+C\. 
(1+ log" Y)Y 
Since 
(2.30) / ae, =o, 
1 (1+logtY)Y 


we see that Y(t) exists for all t € [0,00) in this case. This provides an upper 
bound 


(2.31) llu(t) ie < YC), 


as long as (2.21) holds. Thus Proposition 2.3 will be proved once we establish the 
estimate (2.25). We will establish a general result, which contains (2.25). 
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Lemma 2.4. If P € OPS) ,,8 > n/2, then 


U s 
(2.32) || Pullze < Cllullz - [t+ 10g (7). 


We suppose the norms are arranged to satisfy |u| < ||u||~z». Another way 
to write the result is in the form 


1 
(2.33) | Pullzs < Ce*|lulline + C(log =) |lulla~., 


for 0 < ¢ < 1, with C independent of ¢. Then, letting e° = ||u!||_,~ /||ul| z= yields 
(2.32). The estimate (2.33) is valid when s > n/2+ 6. We will derive (2.33) from 
an estimate relating the L°-, H*’-, and C°-norms. The Zygmund spaces C’ are 
defined in § 8 of Chap. 13. 

It suffices to prove (2.33) with P replaced by P + cl, where c is greater than 
the L?-operator norm of P; hence we can assume P € OPS? 4 is elliptic and 
invertible, with inverse Q € OPS? o. Then (2.33) is equivalent to 


1 
(2.34) ull 1 < Ce*full ae + C(log =) Qua. 


Now since Q : C? — C®, with inverse P, and the C°-norm is weaker than the 
L°°-norm, this estimate is a consequence of 


1 
(2.35) lulla < Ce*|lullize + C(log =) llulloe, 


for s > n/2+6. This result is proved in Chap. 13, § 8; see Proposition 8.11 there. 
We now have (2.25), so the proof of Proposition 2.3 is complete. One conse- 
quence of Proposition 2.3 is the following classical result. 


Proposition 2.5. If dim M = 2, ug € H*(M), k > 2, and div uo = 0, then the 
solution to the Euler equation (2.1) exists for allt € R; u € C(R, H*(M)). 


Proof. Recall that in this case w is a scalar field and the vorticity equation is 


Ow 
(2.36) a t Vuw = 0, 


which implies that, as long as u € C(I, H*(M)), t € J, 
(2.37) Ilw()|[z-° = |]2(0)|Iz- 


Thus the hypothesis (2.21) is fulfilled. 


When dim M > 3, the vorticity equation takes a more complicated form, 
which does not lead to (2.37). It remains a major outstanding problem to decide 


570 17. Euler and Navier-Stokes Equations for Incompressible Fluids 


whether smooth solutions to the Euler equation (2.1) persist in this case. There 
are numerical studies of three-dimensional Euler flows, with particular attention 
to the evolution of the vorticity, such as [BM]. 

Having discussed details in the case M = T”, we now describe modifications 
when M is a more general compact Riemannian manifold without boundary. One 
modification is to estimate, instead of (2.7), 


d 
(2.38) G Ante lle = —2A A PIV u. Jette, Aue) 


= —2(A® PL. Jee, A’ Jette), 


the latter identity holding provided A, P, and J- all commute. This can be 
arranged by taking J = e°4; P and A automatically commute here. In this 
case, with D® replaced by A®, (2.11)-(2.12) go through, to yield the basic esti- 
mate (2.13), provided k = 2¢ > n/2+1. When [n/2] is even, this gives again the 
results of Theorem 2.1—Proposition 2.5. When [n/2] is odd, the results obtained 
this way are slightly weaker, if @ is restricted to be an integer. 

An alternative approach, which fully recovers Theorem 2.1—Proposition 2.5, 
is the following. Let {.X,} be a finite collection of vector fields on M, spanning 
Ty M at each x, and for J = (j1, ..., jx), let X7 = Vx;, °° Vx,,> 4 differential 
operator of order k = |.J|. We estimate 


d 
(2.39) rr |X7 a2 (t)||F2 = —2(X" PULL), X7u,). 


We can still arrange that P and J. commute, and write this as 


OLX J is, x? J) — 2 Glo Ne) 


(2.40) : ; : P 
OO Le, 1X PT) SO Pedi Be. 


Of these four terms, the first is analyzed as before, due to (2.10). For the second 
term we have the same type of Moser estimate as in (2.12). The new terms to 
analyze are the last two terms in (2.40). In both cases the key is to see that, for 
e € (0, 1], 


(2.41) [X7, PJ.] is bounded in OPS}5'(M)_ if |J| =k, 

which follows from the containment P € OPS? .(M ) and the boundedness of 
J- in OP'S) ,(M). If we push one factor Xj, in X/ from the left side to the right 
side of the third inner product in (2.40), we dominate each of the last two terms by 


(2.42) C||LeJetel| zx-1 - ||Uell ze 


if |.J| = k. To complete the estimate, we use the identity 
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(2.43) div(u @ v) = (div v)u+ Vou, 
which yields 
(2.44) LeJetle = div(Jete @ te). 


Now, by the Moser estimates, we have 
(2.45) |LeJetel| e-1 < Cll Jets ® tell ze < Clluellz~ ||Uell ze- 
Consequently, we again obtain the estimate (2.13), and hence the proofs of 
Theorem 2.1—Proposition 2.5 again go through. 

So far in this section we have discussed strong solutions to the Euler equations, 
for which there is a uniqueness result known. We now give a result of [DM], on the 
existence of weak solutions to the two-dimensional Euler equations, with initial 


data less regular than in Proposition 2.5. 


Proposition 2.6. If dim M = 2 and ug € H'?(M), for some p > 1, then there 
exists a weak solution to (2.1): 


(2.46) ué€ L® (Rt, H'?(M)) 1 C(Rt, L?(M)). 


Proof. Take f; € C°(M), f; + uo in H'?(M), and let v; € C°(Rt x M) 
solve 


Die 
(2.47) a +P div(v; ®vj;)=0, divv; =0, 0;(0) = fy. 

Here we have used (2.43) to write V,,v; = div(v; ® v;). Let w; = rot v;, so 
w,;(0) > rot uo in L?(M). Hence ||w;(0)||z» is bounded in 7, and the vorticity 
equation implies 

(2.48) lw; (O\lz2 << C, Vt,7. 

Also ||v;(0)||,2 is bounded and hence ||v,(t)||,2 is bounded, so 

(2.49) |v; || SC. 


The Sobolev imbedding theorem gives H!:?(M) c L?+?°(M), 5 > 0, when dim 
M = 2,80 


(2.50) ||v;(t) ® v;(t)|| p45 < C. 
Hence, by (2.47), 


(2.51) |Opv, (£)|| zp-1.248 =< C. 
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An interpolation of (2.49) and (2.51) gives 

(2.52) vj; bounded in C7 ([0,00), L°(M)), 

for some r > 0, s > 2. Together with (2.49), this implies 

(2.53) \|v;|| compact in C'([0, 7], L*(M)), 

for any T’ < oo. Thus we can choose a subsequence v;, such that 
(2.54) vj, —>u in C([0,T],L7(M)), VI <oo, 


the convergence being in norm. Hence 


(2.55) vj, @ vj, —u@u in C(R*, L*(M)), 
so 
(2.56) P div(v;, @ v;,) —+ Pdiv(u®@u) in C(R*,D'(M)), 


so the limit satisfies (2.1). 


The question of the uniqueness of a weak solution obtained in Proposition 2.6 
is open. 

It is of interest to consider the case when rot up = Wo is not in L?(M) for 
some p > 1, but just in L1(M), or more generally, let wo be a finite measure 
on M. This problem was addressed in [DM], which produced a “measure-valued 
solution” (i.e., a “fuzzy solution,” in the terminology used in Chap. 13, § 11). In 
[Del] it was shown that if wo is a positive measure (and MM = R?), then there 
is a global weak solution; see also [Mj5]. Other work, with particular attention 
to cases where rot ug is a linear combination of delta functions, is discussed in 
[MP]; see also [Cho]. 

We also mention the extension of Proposition 2.6 in [Cha], to the case wo € 
L(log L). 

The following provides extra information on the limiting case p = oo of Propo- 
sition 2.6: 


Proposition 2.7. If dim M = 2, rot ug € L%°(M), and u is a weak solution to 
(2.1) given by Proposition 2.6, then 


(2.57) ue C(Rt x M), 


and, for each t € R*™, in any local coordinate chart on M, if |x — y| < 1/2, 


1 
(2.58) lu(t,2) — u(t, y)| < Cla — y| log na ||rot uo||_t. 
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Furthermore, u generates a flow, consisting of homeomorphisms F' : M > M. 


Proof. The continuity in (2.57) holds whenever uo € H'?(M) with p > 2, as 
can be deduced from (2.46), its corollary 


(2.59) Ou € L©(Rt,L°(M)), p> 2, 
and interpolation. In fact, this gives a Holder estimate on u. Next, we have 
(2.60) ||rot u(£)|| zr < |jrot ug||z~, Vt> 0. 


Since u(t) is obtained from rot u(t) via (2.23), the estimate (2.58) is a conse- 
quence of the fact that 


(2.61) A€ OPS~!(M) = A: L®(M) = LLip(M), 
where, with d(x, y) = dist(x, y), A(6) = 6 log(1/6), 
(2.62) LLip(M) = {f € C(M) : |f(x) — f(y)| < CA((#,y)) }- 


The result (2.61) can be established directly from integral kernel estimates. Alter- 
natively, (2.61) follows from the inclusion 


(2.63) Ci(M) Cc LLip(M), 
since we know that A € OPS~!(M) > A: L©(M) > C}(M). In turn, the 


inclusion (2.63) is a consequence of the following characterization of LLip, due 
to [BaC]: 


Let Uy € O5°(R”) satisfy Vo(€) = 1 for |€| < 1, and set U,(€) = Vo(2-*E). 
Recall that, with wo = Vo, wy, = VU, — Ve_1 fork > 1, 


f € CL(R") => |lve(D) fllb~ < C. 
It follows that, for any u € E’(R”), 
(2.64) u € CL(R") => ||Vuz(D)ullt~ < C. 


By comparison, we have the following: 
Lemma 2.8. Given u € E'(R"), we have 
(2.65) u € LLip(R”) => ||[VU;(D)ullp~ < C(k +1). 
We leave the details of either of these approaches to (2.61) as an exercise. Now, 


for t-dependent vector fields satisfying (2.57)—(2.58), the existence and unique- 
ness of solutions of the associated ODEs, and continuous dependence on initial 
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data, are established in Appendix A of Chap. 1, and the rest of Proposition 2.7 
follows. 

We mention that uniqueness has been established for solutions to (2.1) 
described by Proposition 2.7; see [Kt1] and [Yud]. A special case of Proposition 
2.7 is that for which rot ug is piecewise constant. One says these are “vortex 
patches.” There has been considerable interest in properties of the evolution of 
such vortex patches; see [Che3] and also [BeC]. 


Exercises 
1. Refine the estimate (2.13) to 


d 


(2.66) 5 llue 


2 2 
(t)llie S Cl|Vuellz~ luc) lla: 
fork >n/2+1. 
2. Using interpolation inequalities, show that ifk = s+r, s=n/2+1-+ 6, then 
d 


Gq Muelle S Clluc(é)l| 


2(1+7) = 3), 
Hk ’ a ok’ 


3. Give a treatment of the Euler equation with an external force term: 


ot Yuu = gradp+f, divu=0. 


4. The enstrophy of an smooth Euler flow is defined by 


(2.67) 


(2.68) Ens(t) = Ilw(t)IIZ2 cary: w = vorticity. 
If uw is a smooth solution to (2.1) on J x M, t € I, and dim M = 3, show that 


d 


(2.69) <li 


(t)||22 = 2(Vwu, w) 


2 
5. Recall the deformation tensor associated to a vector field wu, 
(2.70) Def(u) = (VutVu'), 

which measures the degree to which the flow of wu distorts the metric tensor g. 


Denote by ,, the associated second-order, symmetric covariant tensor field (i.e., 
Ou = (1/2)Lug). Show that when dim M = 3, (2.69) is equivalent to 


(2.71) = |\w(t)|32 = 2 f d(w,w) dV. 


6. Show that the estimate (2.32) can be generalized and sharpened to 


7) [[Pullu~ <Cllulley [1+ 10e(E*")|, Pe opst., 


given 6 € [0,1), p € (1,00), and s > n/p. 
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7. Prove Lemma 2.8, and hence deduce (2.61). 


3. Euler flows on bounded regions 


Having discussed the existence of solutions to the Euler equations for flows on a 
compact manifold without boundary in § 2, we now consider the case of a compact 
manifold M with boundary OM (and interior 1/7). We want to solve the PDE 


a 
(3.1) OT + PV,u=0, divu=0, 


with boundary condition 

(3.2) y-u=0 ondM, 

where v is the normal to 0M, and initial condition 

(3.3) u(0) = uo. 

We work on the spaces 

(3.4) V* = {ue H*(M,TM) : divu=0, V+ Ul ya, = OF- 


As shown in the third problem set in § 9 of Chap. 5 (see (9.79), V° is the clo- 
sure of V, (given by (1.6)) in L?(M, TM). Hence the Leray projection P is the 
orthogonal projection of L?(M, 7M) onto V°. This result uses the Hodge decom- 
position, and results on the Hodge Laplacian with absolute boundary conditions, 
which also imply that 


(3.5) P: H*(M,TM) —V*. 
Furthermore, the Hodge decomposition yields the characterization 
(3.6) (I — P)v = —grad p, 

where p is uniquely defined up to an additive constant by 


) 
(3.7) —Ap = divvon M, _ =v-vondM. 
Vv 
See also Exercises 1—2 at the end of this section. 
The following estimates will play a central role in our analysis of the Euler 
equations. 
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Proposition 3.1. Let u and v be C'-vector fields in M. Assume u € V*. Ifv € 
H**1(M), then 


G8) |(Vuv,v)a| < C(lullor|lullire + lhellerllelle )lolire, 

while if v € V*, then 

3.9) | — P)Vurlf je < C{lllcs lone + [lull erlollcr) 

Proof. We begin with the k = 0 case of (3.8). Indeed, Green’s formula gives 

(3.10) (Vyv, w)r2 = —(v, Vaw) 22 — (v, (div u)w) 22 + fo u) (v, w) dS. 
aM 


If div u = O andr - ti aes = 0, the last two terms vanish, so the k = 0 case of 
(3.8) is sharpened to 


(3.11) (Vuv,v)p2=0 ifueV? 
and v is C1! on M. This also holds if u€ V9 C(M,T) and v € H?. 
To treat (3.8) for k > 1, we use the following inner product on H*(M,T). 


Pick a finite set of smooth vector fields {X;}, spanning TM for each x € M, 
and set 


(3.12) (u,v) pe = > (X7%u,X7v)z2, 
[J1<k 


where X7 = Vx;, °° V x;, are as in (2.39), | J| = £. Now, we have 
(3.13)  (X7Vy,v, X7v)p2 = (VyX7u, X70) p2 + ([X7, Valu, X7v) pe. 


The first term on the right vanishes, by (3.11). As for the second, as in (2.12) we 
have the Moser estimate 


(B14) |IIX7, Valoll ye < Chelle: lull» + lellarllolles). 
This proves (3.8). 
In order to establish (3.9), it is useful to calculate div Vv. In index notation 


X = Vuyv is given by XI = v) ,u*, so div X = X/,; yields 


(3.15) div Viv = 0) 4,ju" + v7 guj. 
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If M is flat, we can simply change the order of derivatives of v; more generally, 
using the Riemann curvature tensor R, 


(3.16) V siej = Vick + Ri gjQ0°. 
Noting that Ri jk = Ricex is the Ricci tensor, we have 
(3.17) div V,,v = V(div v) + Ric(u, v) + Tr((Vu)(Vv)), 


where Vu and Vv are regarded as tensor fields of type (1, 1). When div v = 0, of 
course the first term on the right side of (3.17) disappears, so 


(3.18) div v =0 => div Vv = Tr((Vu)(Vv)) + Ric(u, v). 


Note that only first-order derivatives of v appear on the right. Thus P acts on V,v 
more like the identity than it might at first appear. 
To proceed further, we use (3.6) to write 


(3.19) (1— P)V,,v = —grad y, 
where, parallel to (3.7), ~ satisfies 


Oy 


(3.20) —Ag= divV,vonM, — 
Ov 


V- (Vuv) on OM. 


The computation of div V,,,v follows from (3.18). To analyze the boundary value 
in (3.20), we use the identity (v, Vv) = Vu(v, v) — (Wu, v), and note that when 
u and v are tangent to 0M, the first term on the right vanishes. Hence, 


(3.21) (v, Vv) = —(Vyv,v) = II (u,v), 
where IT is the second fundamental form of 0M. Thus (3.20) can be rewritten as 


(3.22)  —Ay =Tr((Vu)(Vv)) + Ric(u,v) on M, oe = —II(u,v). 


V 


Note that in the last expression for Oy/Ov there are no derivatives of v. Now, by 
(3.22) and the estimates for the Neumann problem derived in Chap. 5, we have 


3.23) Vella» <C(lullcsleollare + lrellzllellcr), 


which proves (3.9). 


Note that (3.8)—(3.9) yield the estimate 
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(3.24) |(PVuv,v) ax | < C(llullc» olla + lullzellvllcr ) Ilvllaze, 


givenu € V*¥,v € VRF, 

In order to solve (3.1)-(3.3), we use a Galerkin-type method, following 
[Tem2]. Fix k > n/2+ 1, where n = dim M, and take up € V*. We use 
the inner product on V*, derived from (3.12). Now there is an isomorphism 
Bo: V¥ = (V*)’, defined by (Bov, w) = (v, w) yx. Using V*¥ CV c (V*)!, 
we define an unbounded, self-adjoint operator B on V° by 


(3.25) D(B) ={veV": Bove V}, B=Bo ety 
This is a special case of the Friedrichs extension method, discussed in general in 
Appendix A, § 8. It follows from the compactness of the inclusion V* <> V° that 
B~! is compact, so V® has an orthonormal basis {w; : j = 1,2,...} such that 
Bu; = A;w;, A; 77. Let P; be the orthogonal projection of V° onto the span 
of {wi,...,wy,}. It is useful to note that 


(3.26) (Pju,v)yo =(u,Pjv)yo and (Pju,v)ye = (u, Pyv)ye. 
Our approximating equation will be 


O : 
(3.27) rn + P)Vu,uj =0, uj (0) = Pjuo. 


Here, we extend P; to be the orthogonal projection of L*(M, TM) onto the span 
of {wi, sae , Wy}: 
We first estimate the V°-norm (i.e., the L?-norm) of u;, using 
d 2 
q Nes Ollvo = —2(PjVu; uj, uj) ve 


= —2(Vu,; Uy, Uj) L2- 


(3.28) 


By (3.11), (Vu, uj, Uj) 52 = 0, so 
(3.29) lu; (2)[lvo = ||Pjuoll ze. 
Hence solutions to (3.27) exist for all t € R, for each 7. 


Our next goal is to estimate higher-order derivatives of u;, so that we can pass 
to the limit 7 — oo. We have 


d 
(3.30) a Iu; (t) ||P. = —2(Pj Vu, Uj, Uy )ve = —2(PVu, Uj, us)ve, 


using (3.26). We can estimate this by (3.24), so we obtain the basic estimate: 
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d 
(3.31) Gq Wes live S Cllesllcsllusllve- 


This is parallel to (2.13), so what is by now a familiar argument yields our exis- 
tence result: 


Theorem 3.2. Given uo € V", k > n/2 +1, there is a solution to (3.1)-(3.3) 
fort in an interval I about 0, with 


(3.32) we LUV") n Lip(I,V*—). 
The solution is unique, in this class of functions. 


The last statement, about uniqueness, as well as results on stability and rate of 
convergence as 7 —> oo, follow as in Proposition 2.2. 

If u is a solution to (3.1)—(3.3) satisfying (3.32) with initial data up € V¥, we 
want to estimate the rate of change of ||u(t)||3,., as was done in (2.18)—(2.20). 
Things will be a little more complicated, due to the presence of a boundary OM. 
Following [KL], we define the smoothing operators J. on H*(M,T'M) as fol- 
lows. Assume M is an open subset (with closure /) of the compact Riemannian 
manifold M without boundary, and let 


E:H'(M,T) > H{(M,T), 0<¢<k41, 


be an extension operator, such as we constructed in Chap.4. Let R : H¢ 
(M,T) + H*(M,T) be the restriction operator, and set 


(3.33) Jeu = RJ-Eu, 


where ya is a Friedrichs mollifier on M. If we apply J- to the solution u(t) of 
current interest, we have 

d 2 

eu lize = —2(JePVuu, Jeu) ye 


= -2(JeVutt, Jet) xe + 2(Je(1 — P)Vuu, Jett) pre 


Using (3.9), we estimate the last term by 
2| (Jel P)V gti; Jett) | 


(3.35) 
<C||(L- P)Vuull ge lulls < Cllu(t)|lcr lu) Ile. 


To analyze the rest of the right side of (3.34), write 


(3.36) DeVuths Sette =, (IV tt Ta) pas 


|J|<k 
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using (3.12). Now we have 

(3.37) XI INV yt = X7 (Je, Valu + (X7, Val icu+ Val X7 Jeu). 
We look at these three terms successively. First, by (3.14), 

(3.38) [X7, VulJeul]p2 < Cllu()lleranllu(t) ae cu: 


Next, as in (1.44)-(1.45) of Chap. 16 on hyperbolic PDE, we claim to have an 
estimate 


(3.39) Le, Vale aca) S Cllallexanllulliecan. 


To obtain this, we can use a Friedrichs mollifier a on M with the property that 
(3.40) supp w C K => supp Jew C K, K=M\M. 

In that case, if vw = Hu and w = Ew, then 

(3.41) [Je, Vulw = R[Je, Val @. 

Thus (3.39) follows from known estimates for di, 


Finally, the L?(/)-inner product of the last term in (3.37) with X/ J_u is zero. 
Thus we have a bound 


(3.42) |(JeVuu, Jeu) ze | < Cllu(t)||cr||u®) |g, 
and hence 

d 2 2 
(3.43) qlee lle < Cllu(t)|lc|lu(t) | Fp. 


As before, we can convert this to an integral inequality and take « — 0, obtaining 


t 
(3.44) I|u(t) llF% < lluoll Fre + cf Ilu(s)I ca amyllu(s) [lire as. 
As with the exploitation of (2.19)-(2.20), we have 


Proposition 3.3. [fk > n/2+1, uo € V*, the solution u to (3.1)(3.3) given by 
Theorem 3.2 satisfies 


(3.45) u € O(L,V*). 


Furthermore, if I is an open interval on which (3.45) holds, u solving (3.1)-(3.3), 
and if 
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3.46) sup [lu llevan < K < oe, 
te 


then the solution u continues to an interval I', containing I in its interior, u € 
C(I',V*). 

We will now extend the result of [BKM], Proposition 2.3, to the Euler flow 
on a region with boundary. Our analysis follows [Fer] in outline, except that, as 
in §2, we make use of some of the Zygmund space analysis developed in § 8 of 
Chap. 13. 


Proposition 3.4. [fu € C(I,V*) solves the Euler equation, with k > n/2 + 
1, I = (—a,b), and if the vorticity w satisfies 


(3.47) sup ||w(t)||z~° < K < oo, 
tel 


then the solution u continues to an interval I', containing I in its interior, u € 


C(I',V*). 
To start the proof, we need a result parallel to (2.23), relating u to w. 


Lemma 3.5. If ti and w are the 1-form and 2-form on M, associated to u and w, 
then 


(3.48) ts = 6G4 0 + PAG, 

where GA is the Green operator for A, with absolute boundary conditions, and 
re the orthogonal projection onto the space of harmonic 1-forms with absolute 
boundary conditions. 

Proof. We know that 

(3.49) di=w, dt=0, t=O. 


In particular, 7 € H4(M, A+), defined by (9.11) of Chap. 5. Thus we can write 
the Hodge decomposition of w as 


(3.50) i = (d+ 6)G4(d+ 6)u+ PAG. 
See Exercise 2 in the first exercise set of § 9, Chap. 5. By (3.49), this gives (3.48). 


Now since G4 is the solution operator to a regular elliptic boundary problem, 
it follows from Theorem 8.9 (complemented by (8.54)-(8.55)) of Chap. 13 that 


(3.51) (| CN"), 
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where C?(/) is a Zygmund space, defined by (8.37)(8.41) of Chap. 13. Hence, 
from (3.48), we have 


(3.52) eller < Cl]WOllz~ + Cla) |I[z2- 


Of course, the last term is equal to C'||@(0)||,2. Thus, under the hypothesis (3.47), 
we have 


(3.53) lu(t\lcr < K’ <oo, tel. 


Now the estimate (8.53) of Chap. 13 gives 
(3.54) lu(t)ilcr < C[1 + log* ([lu(t) Le) 


for any k > n/2 4+ 1, parallel to (2.26). 
To prove Proposition 3.4, we can exploit (3.43) in the same way we did (2.19), 
to obtain, via (3.54), the estimate 


d 
(3.55) C(t toe yy, y(t) = llu(t)llir- 


A use of Gronwall’s inequality exactly as in (2.27)-(2.31) finishes the proof. 
As in § 2, one consequence of Proposition 3.4 is the classical global existence 
result when dim M = 2. 


Proposition 3.6. If dim M = 2 and ug € V*, k > 2, then the solution to the 
Euler equations (3.1)—(3.3) exists for allt € R; u € O(R, V*). 


Proof. As in (2.36), the vorticity w is a scalar field, satisfying 


Ow 
BL +Vuw = 0. 


Since u is tangent to OM, this again yields 


() Ilse = [Iew(0) Iz. 


Exercises 


1. Show that if u € L?(M,TM) and div u = 0, then v - Ul oar is well defined in 
H~'(0M). Hence (3.4) is well defined for k = 0. 

2. Show that the result (3.6)—(3.7) specifying (I — P)v follows from (1.44). 
(Hint: Take p = —dG“B.) 

3. Show that the result (3.5) that P : H*(M,TM) — V* follows from (1.44). 
Show that V" is dense in V“, for0 < 2 <k. 
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4. Fors € [0, 00), define V° by (3.4) with s = k, not necessarily an integer. Equivalently, 
V°=V°N H°(M,TM). 
Demonstrate the interpolation property 
V°.V"lo =v, OK<O<1. 


(Hint: Show that P : H°*(M,TM) — V°*, and make use of this fact.) 
5. Let wu be a 1-form on M. Show that d* du = v, where, in index notation, 


ee : k ak 
UZ = Uj;k — Uk; 


In analogy with (3.15)-(3.16), reorder the derivatives in the last term to deduce that 
d*du = V* Vu — dd*u + Ric(u), or equivalently, 


(3.56) (d*d + dd*)u = V*Vu + Ric(u), 


which is a special case of the Weitzenbock formula. Compare with (5.16) of Chap. 10. 

6. Construct a Friedrichs mollifier on M, a compact manifold without boundary, having 
the property (3.40). (Hint: In the model case R”, consider convolution by «~" y(a/e), 
where we require f y(x)dx = 1, and » € C§°(R”) is supported on |z — e1| < 
1/2, er = (1,0,...,0).) 


4. Euler equations on a rotating surface 


Let M = 00 be a rotationally invariant surface in R®, rotating about its axis of 
symmetry (which we take to be the x3-axis) with a constant angular velocity w = 
—()/2. We will assume M is diffeomorphic to the standard sphere S?, and has 
positive Gauss curvature everywhere. In this section we study 2D incompressible 
Euler flows on 7. 

The formulation follows the approach of Rossby, as described in [Ped], which 
yields the Euler equation 


(4.1) a +Vyu=Ox(x2)Ju—Vp, divu=0, 
where 
(4.2) x(x) = €3- (2), 


v (a) being the unit outward pointing normal to / at x. Here w is the flow velocity, 
and J : T;,M — T,,M is counterclockwise rotation by 90°. In case M = S?, we 
have v(x) = x3. Generally, under our hypotheses, x = y(a3). The term Qy Ju 
in (4.1) incorporates the effect of the Coriolis force. 

The treatment here follows [T3]. It was stimulated by the treatment of Euler 
flows on a rotating sphere in [CMa]. Further works on rotating spheres include 
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[CM], [CG] and [Nu]. Much of the literature that brings in the Coriolis force, 
including [Ped], is produced in the setting of rotating planar domains. 

We will rewrite (4.1) as an equation for the 1-form w associated to u, and 
use this formulation to provide a vorticity equation, (4.21), which again is a con- 
servation law. We establish a global solvability result for (4.1), with an estimate 
for ||u(t)||z» (s = 2k > 4) of double exponential type. We then produce some 
classes of stationary solutions to (4.1), including zonal flows, and examine the 
issue of stability of the stationary zonal flows, noting the role of Q in enhancing 
such stability. 

We proceed with the study of (4.1). Bringing in the 1-form w, arising from u 
as in §1, we can rewrite (4.1) as 


On 
(4.3) op t Vu = Ox * a dp, 5% = 0. 

We can eliminate p from (4.3) via the Leray projection P, defined as in §1, obtain- 
ing 


(4.4) as PV yt = QBu, 
Ot 

where 

(4.5) Bi = P(x * Pi). 


It is convenient to produce alternative formulas for B, using the formula 
(4.6) P=—6A5"d, 


where AZ ' denotes the inverse of the Hodge Laplacian on 2-forms, defined to 
annihilate the area form. This follows from the Hodge decomposition. We obtain 


~ _ _ga-l « Pai 
“ emere 
since d * Pu = 0. From (4.5) and (4.7), respectively, we obtain 
(4.8) B*=-B, BeOPs-'(M). 

Going further, since, for M diffeomorphic to S?, H! (MM) = 0, we have 
(4.9) 6 =0on M => ti = xdf, 


for a scalar function f (the stream function), uniquely determined up to an additive 
constant, it is useful to compute 
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(4.10) B(«df) = 6Az* (dy A df). 
With a denoting the area form on /, we have 


dx \ df = — « *dy A df = df A *(*dx) 
(4.11) = (df, xdx)a = (df, JIVx)a 
= +Zf, 


with the vector field 7 given by 


(4.12) Za 5V% 
Note that 
(4.13) div Z = 0. 


The formula (4.10) yields 


B(«df) = 6Az' * Zf 


4.14 
> = +*dAj'Zf. 


Note that (4.13) implies that Z is skew-adjoint and that tus Zf dS = 0. We see 
from (4.14) that 


(4.15) V NKer B = {«df : f ¢ H'(M), Zf =0}, 
where 
(4.16) V = {a € L?(M, A’) : 6% = OF. 


Vorticity equation 
We next derive a PDE for the vorticity w of a flow u, given by 


(4.17) dit = 


Ee 


=wa, w=rotu, 
where qa is the area form on M. For this, it is convenient to rewrite the Euler 
equation (4.3) as 


Ot . 5 1,5 A 
(4.18) Spt Lutt = Oy # + a(S Iu -p), ou = 0, 
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obtained as in (1.15), where £ is the Lie derivative, related to the covariant 
derivative as in (1.13)-(1.14). The advantage of (4.18) is that the Lie derivative 
commutes with the exterior derivative, so applying the exterior derivative to (4.18) 
yields 


a + Ly = Nd(x * i) 


ve = Udy A *it) 
= O(dx, u)a, 


hence (since £,,a = 0) we have the vorticity equation 


Ow 
— + V,w = X(dx, 
(4.20) a = Oey 
= OVuX. 
We can rewrite (4.20) as 
O 
(4.21) — (w — Ox) + Vu(w — Qyx) =0, 


ot 


which is a conservation law. 
It is useful to know that we can reverse the path from (4.18) to (4.19). 


Lemma 4.1. Assume 6u = 0 and set H = di. If W satisfies (4.19), then & satisfies 
(4.18). 


Proof. For such u, the Hodge decomposition on MV allows us to write 


(4.22) oe + Lyi Oxi = dF +6, 


where G is a 1-form on M (for each ¢) satisfying 
(4.23) 5G =0. 
Applying the exterior derivative to (4.22) yields 


Ow 


ay + Lut = UVux)a+ dG. 


(4.24) 
If (4.19) holds, we deduce that 
(4.25) dG =0, 


which, together with (4.23), implies G = 0, since H'(M) = 0. 
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We recall that the identity #1(M/) = 0 also leads to (4.9), or equivalently 
(4.26) divu=0=>u=JVy, 


with scalar f (the stream function) determined up to an additive constant, which 
we can specify uniquely by requiring 


(4.27) / fd =0. 
M 
Note that 
(4.28) w=Af, and i= +*dAq'w, 


with A~! defined on scalar functions to annihilate constants and have range sat- 
isfying (4.27). We can hence rewrite the vorticity equation (4.20) as 


(4.29) ot + (IVF, V(w —Oy)) = 0. 


Another conservation law 

Here we derive a conservation law that explicitly arises from the rotational 
invariance of (7. Say the vector field X3 generates rotation about the x3-axis (of 
period 27). Then X3, as a vector field on M, generates a flow by isometries on 
M. Hence we have div X3 = 0 on M, so there exists € € C™(M) such that 
(4.30) JIVE = —X3. 


Clearly X3€ = 0. We note that 


(4.31) M=S? => x=£=23. 


More generally, under our geometrical hypotheses on 7, we have 


(4.32) x and € are smooth functions of x3, 
with strictly positive 73-derivatives. 


See §3.5 of [T3]. We aim to establish the following conservation law. 


Proposition 4.2. [fu solves (4.1) and rot u = w, then 


(4.33) [mutt x) dS(ax) is independent of t. 
M 
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Proof. From the vorticity equation (4.20) we have 
d Ow 
< [ gwityas = [igs 
M M 
(4.34) = [evel —w) dS 
M 


= = [ (ww) dS + af evux dS. 
M M 


Now (4.32) implies € is a smooth function of y; write € = €(y). Then €Vux = 
VuG(x), where G’(x) = €(x). Hence 


(4.35) if evux dS = [¥eG00 dS = 0, 
M M 
since div u = 0 implies V,, is skew adjoint, and V,,1 = 0. Next, 
— f (9.0) dS = [vas dS 
M M 


= ; (IVF, Vé)(Af) dS 
(4.36) J 


= / (Xsf)(Af) aS 
M 


Now, since X3 commutes with A and is skew-adjoint, 
(4.37) (Xsf, Af) = —(Xs(-A)/7F, (-A)7f) = 0. 


It follows that 


d 
(4.38) ‘ / éw(t) dS =0, 
M 


proving Proposition 4.2. 


Existence of solutions to (4.1) 


Following §2, our approach to existence of solutions to (4.1), or equivalently 
(4.3), with initial data 
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(4.39) (0) = tio € H*(M), Stig =O, 


is to take a mollifier J; = y(eA,), with y € C§°(R) real valued, y(0) = 1, Ay 
the Laplace operator on 1-forms, and solve 

Otte 
(4.40) ot 


+ PJeVu. Jette = VS- Bethe, 

Pte =Ute, tie(0) = Jetio. 
Given € > 0, the short-time solvability of (4.40) is elementary, since this is essen- 
tially a finite system of ODEs. Our first goal is to obtain estimates of a(t) in 


H*(M), for t in some interval independent of ¢, and pass to the limit. 
To start, we have 


1d 


(4.41) ; qlle Ollie = —(PJeVy, Jetic, tie) + 0 JeBJette, tie). 


As in (2.3)-(2.5) of §2, the first term on the right is 0. By (4.8), so is the second 
term on the right. Hence 


(4.42) |te(4)||z2 = ||Jetol|z2- 


This is enough to guarantee global existence of solutions to (4.40), for each e > 0. 
To estimate higher-order Sobolev norms, we bring in 


(4.43) ell zrae = Palle, 
where A = A, is the Laplace operator on 1-forms. Then 


d,. - 
qlliteW) lhizes = (APP Ae 
LVAD tin Wa). 


2 
(4.44) y 


This is similar to (2.38), except we have an extra term in which 22 appears. Esti- 
mates parallel to those in (2.7)-(2.12), including Moser estimates, yield 


d,. ~ 7 x 
(4.45) F[l@e(t)Ilirx S Clit) ll Me) iran + ClO + lide) Ilize2x—, 
which is parallel to (2.13), except that k is replaced by 2h, and there is an extra 
term, involving a factor of |Q|. 
On the 2D manifold V/, 
(4.46) lldllor < Cyl) til| zr 


as long as s > 2, so (4.45) implies 
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(4.47) © Ite(#) 3 < Clltie() [fa + ClQ| - ie (®)|l Fra. 
By Gronwall’s inequality, we have, for t > 0, 

(4.48) lle) Ili S y(4), 

where y(t) solves 


dy 


(4.49) 7 


O(y*/? + |Qly), y(0) = [l-(0) [I 


In particular {a-(t) : 0 < t < Ty} is uniformly bounded, in H+(M/), indepen- 
dent of € € (0, 1], as long as 


(4.50) 720 [- dy 
"6 Jyoy) /? + |Qly 


From here we can argue as in §2 to obtain short time existence: 


Proposition 4.3. Given tio € H*(M), s = 2k > 4, dtio = 0, there is a unique 
solution to (4.3) on an interval I about 0, satisfying 


(4.51) ae C(I, H°(M))N CU, H*1(M)), a0) = tio. 


The solution depends continuously on the initial data to. Furthermore, if u is 
such a solution on I = (—a,b), then & continues beyond the endpoints unless 
\|@() ||o1(z) blows up at an endpoint. 


REMARK. Replacing Moser estimates by Kato-Ponce estimates, one can replace 
8 = 2k > 4in Proposition 4.3 by s > so, for any so > 2. This is done in [T3]. 


The solution w in Proposition 4.3 also satisfies the ¢ = 0 limit of (4.45), 


Oe : r P 
(4.52) = lle) ll S CllA(t) [Ic |]U(E) 3x6 + C1Q| - []@(t)[iz-—2, 


dt 


fors =2k > 4. 

We are now ready to establish global existence of solutions to (4.3), using the 
[BKM] argument as in §2, together with the vorticity equation, expressed as the 
conservation law (4.21), which implies that 


(4.53) I|w(t) — Qxllz = |}w(0) — Qxllz-, 
where w(t) = rot u(t). It follows that 


(4.54) Ilw(t) [b= < |]w(O)||n- + 2|QI, 
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since, by (4.2), ||x||z2 = 1. Now ||w(¢)||z20 does not bound ||z(¢)||o1, but, as in 
(2.25)-(2.26), we have 


(4.55) llallor < Cllw(t)||z~ (1 + log* lu(t)|liz-) + C, 
and hence (4.52) yields 

(4.56) S(t). < CsA(1 + log ||@(t)|F-) |G) lliz- 
with 

(4.57) A = ||w(0)||z- + C|Q| + 1. 


That is, with y(t) = ||a@(t)||7,., we have 


(4.58) — <C,A(1 + logt y(t))y(t), 


which via an argument involving Gronwall’s inequality leads to an estimate of the 
form 


(4.59) \|(t)\|z» < Cl|@(0)||%- exp exp (C.Alt|), 


with A given by (4.57). This implies that ||«%(t)|| 7s is bounded on [0, To) for all 
To < co, so we have: 


Proposition 4.4. In the setting of Proposition 4.3, there is a unique solution to 
(4.3) for all t € R, and it satisfies the global estimate (4.59). 
Stationary solutions to (4.1) 


A stationary solution to (4.1) is one for which Ou/Ot = 0. In such a case, 
w = rot u satisfies (4.20), with Ow/Ot = 0. Hence, by (4.29), 


(4.60) (JV, V(w — Ox)) = 0, 
where 
(4.61) w=Af, t=xdf, 


which defines f uniquely up to an additive constant. The equation (4.60) is equiv- 
alent to 


(4.62) V(Af — Qx) || VF. 
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By Lemma 4.1, whenever f satisfies (4.62), which implies (4.60), then wu, defined 
by (4.61), is a stationary solution to (4.3). 

In particular, we have the following class of stationary solutions. We say f € 
C™(M) is a zonal function if X3f = 0, where the vector field X3 generates a 
27-periodic rotation about the x3-axis. Then we say u = JV f is a zonal velocity 
field. 


Proposition 4.5. Assume M Cc R® is a smooth, compact surface, with positive 
Gauss curvature, and radially symmetric about the x3-axis. If f © C°°(M) isa 


zonal function, then u = JV f is a stationary solution to (4.1), for all Q. 


Proof. Under our hypotheses, we have y = y(a3), f = f(#3), and w = w(a3), 
so (4.62) holds. 


Corollary 4.6. With M as in Proposition 4.5, 

(4.63) each u € V 1 Ker Bis a stationary solution to (4.3). 

Proof. Recall that V M Ker B is given by (4.15), with Z = JV x, as in (4.12). 
The geometrical hypothesis on / implies 7 = ®X3, for some nowhere vanishing 
® € C™(M), which yields (4.63). 

While zonal functions are a prolific source of stationary solutions to (4.3), we 
note that there are stationary solutions that are not zonal. We give examples when 
M = S?, the standard sphere. To get started, note that (4.62) holds whenever 
there is a smooth ~ : R — R such that 
(4.64) Af =v(f)+Q23 (given M = S?). 

We will apply this with «(f) = —A, f, where A, is chosen from 
(4.65) Spec(—A) = {\y =k? +h: =0,1,2,3,...}. 


Note that x3 is an eigenfunction of —A with eigenvalue \, = 2. Thus we assume 
k > 2. Then (4.64) becomes 


(4.66) (A + rx) f = a3. 


As long as Ax, 4 2, (4.66) has solutions, and the general solution is of the form 


Q 
(4.67) f= \ 23923 + 9k; Gk © Ker(A +4 Ax). 
ee 


Thus gj, is the restriction to S? of a harmonic polynomial, homogeneous of degree 
k. For example, we can take 
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=2, A, =6, gx(x) = a} — x2, 


(4.68) a8 
k=3, A,p=12, gxe(x) = Re(x1 + ia2)”, 


and so on. For such f as in (4.67), we have 


Q 
(4.69) u=— X3 + IVG« 
eS 


as a Stationary solution to (4.1). 

Stationary solutions of the form (4.69) are known as Rossby-Haurwitz solu- 
tions of degree k. Such solutions, particularly in degree 2, arise in meteorology. 
They are observed on Earth, as well as the giant planets in our solar system. The 
paper [CG] analyzes the instability of such solutions, and discusses the impact 
of such instability on the difficulty of long-time weather forecasts. The paper 
[Nu] constructs further non-zonal stationary solutions on S? that are close to the 
Rossby-Haurwitz solutions. 


Stability of stationary solutions 


We next examine stability of stationary zonal solutions of (4.1), as usual assum- 
ing M is radially symmetric about the x3-axis and has positive Gauss curvature. 
We use the following variant of the Arn’old stability method (cf. [AK], pp. 89- 
94). Namely, we look for stable critical points of a functional 


1 
(4.70) H(u) = [ {ser + p(w — Qyx) + réw} dS, 
M 


with w = rot u and y and ¥ tuned to the specific stationary solution u. Recall that 
x and € are given by (4.2) and (4.30). Such a functional is independent of ¢ when 
applied to a solution u(t) to (4.1). Taking 


(4.71) u=JSVf, sow=Ayf, 
we rewrite (4.70) as 
1 2 
(4.72) H(A) = [ {5IVsP + e(Af- Ox) + EAs} a8. 
M 
A computation gives 


an(t+sa)= | {(0F.99) +s1Vo" 
(4.73) M 


+ y'(Af + sAg — Ox) Ag+ 7édg} dS, 
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sO 


dsH(f + s9)|,_ 


= [ov Vo) + ¢' (Af —Qx)Agt+ réAg} dS 
(4.74) A 


= [{-ar + Ay(Af -9x) + Ela as. 
M 


This vanishes for all g if and only if f — y’(Af — Qy) — y€ is constant. Since the 
stream function f is determined only up to an additive constant, we can write 


(4.75) f= (Af —Qx) + 7 


as the condition for f to be a critical point of H in (4.72). Note that (4.75) implies 
that, if 


(4.76) VFILVE, 

then 

(4.77) V(Af — Ox) | VF, 
hence 

(4.78) (JV, V(w — Ox)) = 0, 


so, by Proposition 4.5, such f produces a stationary solution to (4.1), provided f 
is a zonal function. If f is not a zonal function, one would need to take y = 0 in 
(4.70) in order for (4.78) to hold. Thus the Arn’old method apparently produces 
weaker stability results for non-zonal stationary solutions than for zonal stationary 
solutions. 

To proceed, we apply O, to (4.73) and evaluate at s = 0, to get 


4.79) H(F+59)|,_9= f {IVI +e" -20(A9)?} as. 
M 


Now, if we are given a zonal function f, we want to find y such that (4.75) 
holds, and then check (4.79) to see if this is a coercive quadratic form in g. Let us 
write also 


(4.80) Af=w(§), x=x(§). 


Then (4.75) takes the form 
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(4.81) F(E) = y'(w(E) — Ox(€)) + 4€, 
(4.82) gp’ (w(E) — Ox(€)) = F(E) — 4€- 


Given arbitrary 7 € R, this identity uniquely specifies vy’, provided 


(4.83) w(€) — Qy(€) is strictly monotone in €. 
that is, 
(4.84) w' (€) — Qx'(€) is bounded away from 0. 


With y’ determined, in turn y is determined, up to an additive constant, which 
would not affect the critical points of (4.72). Then, applying d/d€ to (4.82) yields 


(4.85) yp" (w —Qyx) = iG f < 5: 


Substitution into (4.79) gives 


(4.86) OP H(f +s9)|,_4 = [fiver +4 mg (0""} dS. 


M 


As long as the Gauss curvature of M is everywhere positive, both y and € are 
sooth, strictly monotone functions of 73, with positive 73-derivatives, so 


(4.87) x/(§) >a >O0on M. 


As long as the hypothesis (4.83)—(4.84) holds, then either 
(4.88) 


on M. In the first case, we can make 


(4.89) K() = mes 


on M by taking y > 0 large enough, and in the second case we can arrange (4.89) 
by taking ¥ sufficiently negative. Both cases yield 


>c>0 


(4.90) 0? H(f + sg)|,_9 = IlVall? + Cll Agliz2, 
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with C > 0, for all g € H?(M). This implies stability of f in H?(M) as a 
critical point of (4.72) (recall that f is defined only up to an additive constant), 


hence stability of win H1(M) as a critical point of (4.70). We summarize. 


Proposition 4.7. Given a smooth f(€), u = JV f is a stable stationary solution 
to (4.1), in H*(M), as long as Q is such that (4.83)-(4.84) hold, where w = Af. 


Note that w = Af implies 
(4.91) w(E) = (8) + FOIVE?, 
if f = (6). 
Linear stability 

We consider the linearization of the Euler equation (4.1) about a stationary 
solution u = JV f. More precisely, we work with the vorticity equation (4.29), 
for w = rotu = Af. We set 


(4.92) f-(th=ften+---, we(t)=wtec(t)t+---, ¢=Arn. 


Inserting these into the analogue of (4.29), using (4.29), and discarding higher 
powers of € produces the linearized equation 


(4.93) O6 + (IVA, VC) + (JV, V(w — Qx)) = 0. 


The second term on the left side of (4.93) is equal to —V jy(w—ay)7- Also, we 
can write 7 = A~!¢, where we define A~! to annihilate constants and have range 
orthogonal to constants. Then (4.93) becomes 


ag 


4.94 el 

(4.94) eae 

where 

(4.95) TC = —Vav 6 + Vavww-ay 7". 


The question of linear stability is the question of whether I’ generates a bounded 
group of operators on 


(4.96) L2(M) = {¢ © L?(M): [sas = 0}. 
M 


If we assume f is zonal, and recall our symmetry hypothesis on /, we have 
X3f = X3w =0,s0 f = f(€), w = w(€), so 
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(4.97) IVA =—f'(O)X3,  IV(w — Ox) = [Ox'(§) — w'(f)] Xs, 
and (4.95) becomes 
(4.98) TC = f'(E)Xa¢ + (Ox'(€) — w'(§)) Xa". 
In such a case, ’ commutes with X3, and we can decompose 


(4.99) L3(M) = QBVi, Vi = {6 € Le (M) : X3¢ = tk}, 
k 


obtaining 

(4.100) T= Tx, Ty, =ikMy, : Ve > Ve, 
k 

with M;, = Mv, 

(4.101) MC = f'(E)C + [2x'(E) — w'(E)JA~*S. 


Making use of this, [T3] established the following. 


Proposition 4.8. Assume T has the form (4.98). If SpecT is not contained in the 
imaginary axis, then some V;, has an eigenvalue with nonzero real part. 


This led to the following result, Proposition 4.2.3 of [T3]: 
Proposition 4.9. [fT has a eigenvalue with nonzero real part, then 
(4.102) w’(s) — Qy'(s) must change sign in s € (ao, 01), 


where (a, | is the range of €, and 


Vik ER, ds € (ao, a1) such that 


(4.103) 
(w"(s) — 2X'(s))(f"(s) — K) < 0. 

In the setting of planar flows (and with Q = 0), (4.102) is known as the 
Rayleigh criterion for linear instability, and (4.103) is called the Fjortoft crite- 
rion; see [MP], pp. 122-123. Proposition 4.9 is close to Proposition 4.7 in the 
following sense. By Proposition 4.7, if 


(4.104) w'(s) —Qy'(s) £0 forall s € [ao, ai], 


then the associated stationary zonal solution u = JV/f to (4.1) is stable, in 
Ht (M). Condition (4.102) is a little stronger than the assertion that (4.104) fails. 
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An appendix to [T3], “Matrix approach and numerical study of linear insta- 
bility,” by J. Marzuola and M. Taylor, investigates cases where the spectrum of I 
might be confined to the imaginary axis (i.e., the operators (/;, have real spectrum) 
even when (4.102) fails. Here, they specialize to M = S? and make calculations 
in cases 


(4.105) f(a) =cP.(x3), w(x) = —A cP, (a3), v = 2,3,4. 
The spaces V;, have orthogonal bases 
(4.106) {et PF (a3) : £ > |kl}, 


where fay are associated Legendre polynomials. Classical identities for spherical 
harmonics lead to representations of 4; as infinite matrices, whose truncations 
converge rapidly. Numerical work, using Matlab, indicates linear stability for Q 
somewhat less restricted than the Arnold-type stability analysis in Proposition 
4.7 requires. These numerical results suggest that there is more to discover about 
stability of zonal stationary solutions to (4.1). 


Other estimates on perturbations of stable stationary solutions 


If ws is a smooth zonal flow (hence stationary) that satisfies the stability con- 
dition of Proposition 4.7, and u®(t) is a solution to (4.1) with initial data close to 
us in H+-norm, then we have H(u*(t)) independent of t, and 


(4.107) |H(u®) —H(us)| © lu? — usllzn, 


when H(u) is given by (4.70), hence ||u*(¢) — wg|| 771 is bounded by C||u*(0) — 
us|| 71 for all t € IR. We now look for other bounds on 


(4.108) w(t) = u*(t) — Us, 


using methods of [T5], done there in the setting of planar (non-rotating) domains. 
Our analysis will use estimates of the form 


Allwllits 


4.109 <c(l 
(4.109) lwllor < C (log ete 


) [| rot wll z=, 


along lines similar to (4.55), derived from (8.49) of Chapter 13 (with & = 1), and 
also 


Apeeulieai2 
(4.110) lwllix < Clog Stet whe | rot wh n2, 


|| rot wl p2 
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similarly derived from (8.56) of Chaper 13, with n = q = 2,s = 1,p > 2, 
in both cases assuming divw = 0. Note that then || rot w||z2 * ||w]|q1, and 
the H'-norm on the 2D manifold M just fails to dominate the L°°-norm. Also 
|rot w||ze0 > ||rot w||z» & ||wl| qi, for 2 < p < oo. The estimate (4.109) 
holds whenever s > 2. In light of Proposition 4.4, we take s = 4. 

To apply (4.110) to w = w*(t), given by (4.108), we estimate || rot w* (¢)||z- 
crudely, using 


(4.111) || rot w* (t)||ze0 < || rot ué (#)|| ze + || rot ws||L~, 


plus the conservation law for vorticity, which gives 


(4.112) || rot ué (t) — Qy||te = || rot u* (0) — Ox||r~, 
hence 
(4.113) || rot u=(£)||L- < || rot u*(0)||z- + 2|Q]. 


Taking into account (4.107)-(4.108), we have 


AK(u®(0), ts) \ 1? 
oe as) w*(0)\lzn, 
K(u*(0),us) = || rot u*(0)||z- + | rot us|| b> + 2|Q|. 


“(t)\|n~ <C(1 
aes rw" Ollu~ < Clog 


In order to tackle the estimate on ||w*(#)||c1 via (4.109), we need an estimate 
on || rot w*(t)||z-0 different from what (4.111)-(4.113) gives. To get it, we take 
the vorticity equations for u*(0) and for wu, and form their difference, obtaining 


(4.115) o (rot w’) + Vue (rot w*) = —Vwe (rot us — 2x). 


This yields the estimate 


(4.116) || rot w* (t) || L-° < || rot w* (0) ||z- + Ko(u* (0), us) |], 
with 
(4.117) K2(u*(0), us) = || rot us — Qyx||cu sup ||w*(s)||r~, 


the last factor being 


AK (u®(0), us) 


7 oe 
TO) te Olen 


(4.118) < C (log 
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by (4.114). For large |t|, (4.116) is weaker than (4.111)-(4.113), so we interpret 
(4.116) as being effective on a time scale 


(4.119) |t] < Ko(u®(0),us)7*. 


For larger |t|, we should just use (4.111)-(4.113). 
With these results in hand, we then have from (4.109) that 


w* (Oller < C(log* Alle (Ola) | tot w* (Olin 


(4.120) 1 
C( log? ————___—_ t w* (t oo. 
+ C(log [rote] cx) tow Oz 
We have 
(4.121) log* Aljw®(t)||- < log* A(lu®(O)|l1+ + [lusllre), 


and the estimate (4.59) applies to ||u*(t)|| =, leading to an exponential estimate 
on (4.121). 

Observe the remarkable transition from a slow loss of stability measured via 
|| rot w*(t) ||, to an exponentially exploding loss of stability measured in the 
slightly stronger norm ||w*(t)||c1. Of course, taking into account the impact on 
the flows generated by u*, the latter norm is much more significant than the for- 
mer. To be sure, the estimates (4.120)-(4.121) are only upper bounds, and the 
issue of how sharp they are is an interesting problem. 


5. Navier-Stokes equations 


We study here the Navier-Stokes equations for the viscous incompressible flow 
of a fluid on a compact Riemannian manifold 1/7. The equations take the form 


3) 
(5.1) OT +V,u=vLu— gradp, divu=0, u(0) =uo. 
for the velocity field u, where p is the pressure, which is eliminated from (5.1) 
by applying P, the orthogonal projection of L?(M,TM) onto the kernel of the 
divergence operator. In (5.1), V is the covariant derivative. For divergence-free 
fields u, one has the identity 


(5.2) Vuu = div(u ®@ u), 


the right side being the divergence of a second-order tensor field. This is a special 
case of the general identity div(u ®@ v) = V,u + (div v)u, which arose in (2.43). 
The quantity v in (5.1) is a positive constant. If M = R”, CL is the Laplace 
operator A, acting on the separate components of the velocity field w. 
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Now, if / is not flat, there are at least two candidates for the role of the Laplace 
operator, the Hodge Laplacian 


A = —(d*d + dd’), 


or rather its conjugate upon identifying vector fields and 1-forms via the 
Riemannian metric (“lowering indices”), and the Bochner Laplacian 


La=-V'*V, 


where V : C°(M,TM) > C(M, T* ®T) arises from the covariant derivative. 
In order to see what £ is in (5.1), we record another form of (5.1), namely 


a 
(5.3) HT +V,u=vdivS— gradp, divu=0, 


where S is the “stress tensor” 
S=Vu4 Vu' = 2 Def u, 


also called the “deformation tensor.” This tensor was introduced in Chap. 2, § 3; 
cf. (3.35). In index notation, $7* = uJ** + u*J, and the vector field div S is given 
by . ; ; 
SIF = uF +4 uP, 
The first term on the right is —V*Vu. The second term can be written (as in 
(3.16)) as 
u*..') + R¥gJu! = (grad div u + Ric(u))’. 


Thus, as long as div u = 0, 
div S = —V*Vu + Ric(u). 


By comparison, a special case of the Weitzenbock formula, derivable in a similar 
fashion (see Exercise 5 in the previous section), is 


Au = —V*Vu — Ric(u) 
when wu is a 1-form. In other words, on ker div, 
(5.4) Lu = Au + 2 Ric(w). 


The Hodge Laplacian A has the property of commuting with the projection P 
onto ker div, as long as M has no boundary. For simplicity of exposition, we 
will restrict attention throughout the rest of this section to the case of Riemannian 
manifolds M for which Ric is a constant scalar multiple co of the identity, so 


(5.5) £L=N4+2c onker div, 
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and the right side also commutes with P. Then we can rewrite (5.1) as 


(5.6) a =vlLu—PV,,u, u(0) = up, 


where, as above, the vector field uo is assumed to have divergence zero. Let us 


note that, in any case, 
L = —2 Def* Def 


is a negative-semidefinite operator. 

We will perform an analysis similar to that of § 2; in this situation we will 
obtain estimates independent of v, and we will be in a position to pass to the limit 
v — 0. We begin with the approximating equation 


6) 
aaa PIV u. Jee =VIL Tee, Ue (0) = uo, 


(5.7) at 


parallel to (2.2), using a Friedrichs mollifier J-. Arguing as in (2.3)-(2.6), we 
obtain 


d 
(5.8) Ff ||ue(t)||Z2 = —4v||Def Jeue(t)||72 < 0, 
hence 
(5.9) I|ue(t)\|z2 < ||uollz2- 


Thus it follows that (5.7) is solvable for all t € IR whenever v > 0 ande > 0. 
We next estimate higher-order derivatives of u-, as in § 2. For example, if M = 

T”, following (2.7)-(2.13), we obtain now 

d 

q We Ollie S Clue llc lluc(t) fie — 4v|[Def Jeue(€)[lir« 


< Cllue(t)||c1|Iue(t)|lire, 


(5.10) 


for vy > 0. For more general MM, one has similar results parallel to analyses of 
(2.34) and (2.35). Note that the factor C' is independent of v. As in Theorem 2.1 
(see also Theorem 1.2 of Chap. 16), these estimates are sufficient to establish a 
local existence result, for a limit point of u- as € + 0, which we denote by uy. 


Theorem 5.1. Given up € H*(M), k > n/2 +1, with div ug = 0, there is a 
solution u, on an interval I = {0, A) to (5.6), satisfying 


(5.11) uy € L (I, H*(M)) On Lip(I, H*-?(M)). 


The interval I and the estimate of u, in L*® (I, H®(M)) can be taken independent 
ofv > 0. 
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We can also establish the uniqueness, and treat the stability and rate of conver- 


gence of uz to u = wu, as before. Thus, with « € [0,1], we compare a solution 
u = u, to (5.6) to a solution u,- = w to 


a 
(5.12) oa + PIV yJew =vILJew, w(0) = wo. 


Setting v = u, — Uy,-, we have again an estimate of the form (2.16), hence: 


Proposition 5.2. Given k > n/2 + 1, solutions to (5.6) satisfying (5.11) are 
unique. They are limits of solutions Uy< to (5.7), and, fort € I, 


(5.13) luv (t) — uve (t)Ilz2 < Ki (tL — Jellccae-1,12); 
the quantity on the right being independent of v € |0, 00). 


Continuing to follow § 2, we can next look at 


d 
(5.14) dt || D° Jeur(t)|l 72 i= 2( D* JL tin, D)up, De Ais 
— 2v||Def D* Jetty (t)|| 535 


parallel to (2.18), and as in (2.19)—(2.20) deduce 


d 
Gq Wu (#) lire < Cllr () lc» lle) lize — 4v||Def w(t) [lire 


< Cllur(t)lc+ lu ()|liz*- 


(5.15) 


This time, the argument leading to u € C(I, H*(M)), in the case of the solution 
to a hyperbolic equation or the Euler equation (2.1), gives for u, solving (5.6) 
with uo € H*(M), 


(5.16) u, is continuous in t with values in H*(M), att =0, 


provided k > n/2 +1. At other points t € J, one has right continuity in t. This 
argument does not give left continuity since the evolution equation (5.6) is not 
well posed backward in time. However, a much stronger result holds for positive 
t € I, as will be seen in (5.17) below. 

Having considered results with estimates independent of v > 0, we now look 
at results for fixed v > 0 (or which at least require v to be bounded away from 0). 
Then (5.6) behaves like a semilinear parabolic equation, and we will establish the 
following analogue of Proposition 1.3 of Chap. 15. We assume n > 2. 


Proposition 5.3. [f div uo = 0 and up € L?(M), with p > n = dim M, and if 
v > 0, then (5.6) has a unique short-time solution on an interval I = [0,T): 
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(5.17) u = up € C(I, L?(M))N C%((0,T) x M). 
Proof. It is useful to rewrite (5.6) as 


Ou 


(5.18) DE 


+ Pdiv(u@u)=vLu, u(0) = uo, 
using the identity (5.2). In this form, the parallel with (1.16) of Chap. 15, namely, 


Ou 
OE =vAut S > Oj Fj (u), 


is evident. The proof is done in the same way as the results on semilinear parabolic 
equations there. We write (5.18) as an integral equation 


t 
(5.19) u(t) = eu — | grep div(u(s) @ u(s)) ds = u(t), 
0 


and look for a fixed point of 

(5.20) W:CU,X) > CU,X), XxX = L?(M)n ker div. 

As in the proof of Propositions 1.1 and 1.3 in Chap. 15, we fix a > 0, set 
(5.21) Z = {ue C((0, TI, X) : u(0) = uo, ||u(t) — wollx < a}, 


and show that if T’ > 0 is small enough, then VW : Z — Z is a contraction map. 
For that, we need a Banach space Y such that 


(5.22) ® : X — Y is Lipschitz, uniformly on bounded sets, 
(5.23) eo TY = X, fort > 0, 


and, for some y < 1, 

(5.24) le“ IIea.x) < Ct77, fort € (0, 1]. 
The map @ in (5.22) is 

(5.25) ®(u) = P div(u ® wu). 


We set 
Y = H~1”/2(M) 1: ker div, 


and these conditions are all seen to hold, as long as p > n; to check (5.24), 
use (1.15) of Chap.15. Thus we have the solution uw, to (5.6), belonging to 
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C([0, 7], L?(M)). To obtain the smoothness stated in (5.17), the proof of smooth- 
ness in Proposition 1.3 of Chap. 15 applies essentially verbatim. 

Local existence with initial data ug € L"(M) was established in [Kt4]. We 
also mention results on local existence when ug belongs to certain Morrey spaces, 
given in [Fed, Kt5, T2]. 

Note that the length of the interval J on which wu, is produced in Proposition 5.3 
depends only on ||uo||z» (given M and v). Hence one can get global existence 
provided one can bound ||u(¢)||z»(ar), for some p > n. In view of this we have 
the following variant of Proposition 2.3 (with a much simpler proof): 


Proposition 5.4. Givenv > 0, p> n, ifu € C([0,T), L?(M)) solves (5.6), and 
if the vorticity w satisfies 


(5.26) sup |lw(t)ln <K<0, g=—”, 
teE[0,T) n+p 


then the solution u continues to an interval [0,T"), for some T’ > T, 

u € C((0,T’), L°?(M)) nC ((0, 7") x M), 
solving (5.6). 
Proof. As in the proof of Proposition 2.3, we have 

u= Aw + Pou, 

where Po is a projection onto a finite-dimensional space of smooth fields, A € 
OPS~'(M). Since we know that ||u(t)||z2 < ||wol|z2 and since A : L? > 
H'4 c L?, we have an L?-bound on u(t) as t 7 T, as needed to prove the 
proposition. 


Note that we require on q precisely that g > n/2, in order for the corresponding 
p to exceed n. 
Note also that when dim M = 2, the vorticity w is scalar and satisfies the PDE 


0 
(5.27) oa +V yw = (A + 2co)w; 
as long as (5.5) holds, generalizing the vy = 0 case, we have ||w(t)||z~ < 


e7¥0!|l1(0) || ,00 (this time by the maximum principle), and consequently global 
existence. 
When dim M = 3, w is a vector field and (as long as (5.5) holds) the vorticity 
equation is 
Ow 


(5.28) OE +V,w—-Vyu=vLw. 
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It remains an open problem whether (5.1) has global solutions in the space 
C'™((0,00) x M) when dim M > 3, despite the fact that one thinks this should 
be easier for v > O than in the case of the Euler equation. We describe here a 
couple of results that are known in the case v > 0. 


Proposition 5.5. Let k > n/2+1, v > 0. If ||uollz« is small enough, then (5.6) 
has a global solution in C'({0, 00), H®) NC™((0,00) x M). 


What “small enough” means will arise in the course of the proof, which will 
be a consequence of the first part of the estimate (5.15). To proceed from this, we 
can pick positive constants A and B such that 


||Def ul|zx > Allullze — Bllullz2, 
so (5.15) yields 


d 
a luli < {Cllu@) lox — vA} |lullire + 2vBllu(e)|l72- 


Now suppose 
llwollz2 $9 and |luollze < Ld; 


L will be specified below. We require Ld to be so small that 
A 
(5.29) llv[2. < 2L6 => |lullor < _ 


< 216, we 


Recall that ||u(t)|| 2 < ||uol|z2. Consequently, as long as ||u(t)||7,.. < 


have d 
ap VAY + 2yB5, y(t) = [lu(t)|lire- 


Such a differential inequality implies 
(5.30) y(t) < max{y(to),2BA~*d}, fort > to. 


Consequently, if we take L = 2B/A and pick 6 so small that (5.29) holds, we 
have a global bound ||u(¢)||7,. < L6, and corresponding global existence. 


A substantially sharper result of this nature is given in Exercises 4—9 at the end 
of this section. 

We next prove the famous Hopf theorem, on the existence of global weak solu- 
tions to (5.6), given v > 0, for initial data ug € L?(M). The proof is parallel 
to that of Proposition 1.7 in Chap. 15. In order to make the arguments given here 
resemble those for viscous flow on Euclidean space most closely, we will assume 
throughout the rest of this section that (5.5) holds with cp = 0 (i.e., that Ric = 0). 
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Theorem 5.6. Given uo € L?(M), div uo = 0, v > 0, the (5.6) has a weak 
solution for t € (0,00), 


ué L™(Rt, L?(M)) 1 LZ,(R*, H'(M)) 


31 
een AM Lipj. (Rt, H~?(M) + H7**(M)). 


We will produce u as a limit point of solutions u- to a slight modifica- 
tion of (5.7), namely we require each J- to be a projection; for example, take 
Je = x(€A), where (A) is the characteristic function of [—1, 1]. Then J- com- 
mutes with A and with P. We also require u-(0) = J-uo; then ue(t) = Jeue(t). 
Now from (5.9), which holds here also, we have 
(5.32) {ue : € € (0, 1]} is bounded in L*°(R™, L”). 


This follows from (5.8), further use of which yields 
T 
(5.33) w | ||Def ue(t)|Z2 dt = || Jeuoll72 — |lue(T)|l72, 
0 
as in (1.39) of Chap. 15. Hence, for each bounded interval J = [0, T], 
(5.34) {u-} is bounded in L?(I, H'(M)). 


Now, as in (5.18), we write our PDE for uz as 


Ouz 
ot 


(5.35) + PJ, div(uz ® uz) = vAuze, 


since Jz AJ-u- = Auz. From (5.32) we see that 


(5.36) {uc ® Ue : € € (0, 1]} is bounded in L* (Rt, L'(M)). 


We use the inclusion L!(M) Cc H~"/?—9(M). Hence, by (5.35), for each 5 > 0, 


(5.37) {9,u-} is bounded in L?(I, H~"/?-1~°(M)), 
SO 
(5.38) {uz} is bounded in H1(I, H~"/2-1~°(M)). 


As in the proof of Proposition 1.7 in Chap. 15, we now interpolate between 
(5.34) and (5.38), to obtain 


(5.39) {ue} is bounded in H*(I, H1~8—8("/2+149)()), 
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and hence, as in (1.45) there, 


(5.40) {u-} is compact in L?(I,H*~7(M)), 
for all y > 0. 
Now the rest of the argument is easy. We can pick a sequence ux, = Ue, 


(Ex — 0) such that 
(5.41) ux > u_ in L?((0,T], H'~7(M)), in norm, 


arranging that this hold for all T’ < oo, and from this it is easy to deduce that wu is 
a desired weak solution to (5.6). 

Solutions of (5.6) obtained as limits of we as in the proof of Theorem 5.6 are 
called Leray—Hopf solutions to the Navier-Stokes equations. The uniqueness and 
smothness of a Leray—Hopf solution so constructed remain open problems if dim 
M > 3. We next show that when dim / = 3, such a solution is smooth except 
for at most a fairly small exceptional set. 


Proposition 5.7. If dim M = 3 and u is a Leray—Hopf solution of (5.6), then 


there is an open dense subset J of (0,00) such that R* \ J has Lebesgue measure 
zero and 


(5.42) wel”( 72M), 


Proof. For T > 0 arbitrary, J = [0,7], use (5.40). With uz, = ue,, passing toa 
subsequence, we can suppose 


(5.43) \|Uroa —Uelle <2-*, E=L?(1,H*-7(M)). 


Now if we set 


(5.44) P(t) = sup [lun(@) lla. 

we have 

5.45) (8) < |jun(2)[lan—> + llemaa(t) — uel) nv, 
k=1 

hence 

(5.46) Te E7(7). 


In particular, ['(t) is finite almost everywhere. Let 


(5.47) S={tEI:T(t) < ow}. 
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For small y > 0, H!~7(M) c L?(M) with p close to 6 when dim M = 3, 
and products of two elements in H!~7(M) belong to H'/2-7 (M), with 7/ > 0 
small. Recalling that u_ satisfies (5.35), we now apply the analysis used in the 
proof of Proposition 5.3 to uz, concluding that, for each tp € S, there exists 
T(to) > 0, depending only on I(t), such that, for small y/ > 0, we have 


{ux} bounded in C([to, to + T(to)], H'~7(M)) NC™ ((to, to + T(to)) x M). 


Consequently, if we form the open set 


(5.48) Ir = J (to, to +T(to)), 


toES 


then any weak limit u of {u;} has the property that u € C(Jr x M). It remains 
only to show that I \ Jr has Lebesgue measure zero; the denseness of Zr in I 
will automatically follow. To see this, fix 6; > 0. Since meas(J \ S) = 0, there 
exists 62 > 0 such that if S5, = {t € S : T(t) > dg}, then meas(I \ S5,) < 64. 
But Jr contains the translate of S5, by 62/2, so meas(I \ Jr) < 61 + 62/2. This 
completes the proof. 


There are more precise results than this. As shown in [CKN], when M = R°, 
the subset of Rt x M on which a certain type of Leray—Hopf solution, called 
“admissible,” is not smooth, must have vanishing one-dimensional Hausdorff 
measure. In [CKN] it is shown that admissible Leray—Hopf solutions exist. 

We now discuss some results regarding the uniqueness of weak solutions to the 
Navier-Stokes equations (5.6). Thus, let J = [0, T], and suppose 
(5.49) vw, €L? (1.1 (M)) 0, A'M)), j= 1,2, 
are two weak solutions to 

Ou; : 
(5.50) aE + P div(u; ®u;) = vAu,, u;(0) = uo, 


where uo € L?(M), div uo = 0. Then v = uy — uz satisfies 


(5.51) 2 + P div(u; @vu+v@uz) =vAv, v(0) =0. 


We will estimate the rate of change of ||v(t)||7.., using the following: 


Lemma 5.8. Provided 
(5.52) v€L?(I,H'(M)) and _ € L°(I,H-1(M)), 


then ||v(t) 22 is absolutely continuous and 
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d 
at Ilw(t)|Z2 =2(u,,v)ra € yy. 


Furthermore, v € C(I, L?). 


Proof. The identity is clear for smooth v, and the rest follows by approximation. 


By hypothesis (5.49), the functions u,; satisfy the first part of (5.52). By (5.50), 
the second part of (5.52) is satisfied provided uj; ® uj € L?(I x M), that is, 
provided 


(5.53) uj € LA(I x M). 


We now proceed to investigate the L?-norm of v, solving (5.51). If u; satisfy 
both (5.49) and (5.53), we have 


d 
(5.54) Gi le Ollze = -2(Vour, v) — 2(Vuzv, 0) — 2u||Vollz2 


= 2(u1, Vov) — 2v||Vol|72, 


since V7, = —V, and V7, = —Vu, for these two divergence-free vector fields. 
Consequently, we have 


d 
(5.55) q Ollze S 2lleallzs - |lellca - [| Vellc2 — 2u||Vollze. 


Our goal is to get a differential inequality implying ||v(t)||;2 = 0; this requires 
estimating ||v(t)||;4 in terms of |v(t)||z2 and ||Vul|z2. Since H'/2(M?) c 
L4(M?) and H'(M*) c L°(M3), we can use the following estimates when 
dim M = 2 or 3: 


llollza < Clo Z2 «Volz + Cllollze, dim M = 2, 


(5.56) 
llollza < Clo Z" - Vol + Cllolze, dim M = 3. 


With these estimates, we are prepared to prove the following uniqueness result: 


Proposition 5.9. Let u; and uz be weak solutions to (5.6), satisfying (5.49) and 
(5.53). Suppose dim M = 2 or 3; if dim M = 3, suppose furthermore that 


(5.57) u, € LS(I,L4(M)). 
Tfu (0) = u2(0), then uy = ug onI x M. 
Proof. For v = wu; — ue, we have the estimate (5.55). Using (5.56), we have 


2|urllzallellza||Vullce < vl|Vollz2 + Cv? |lollz2 = [leallzs 


(5.58) 
+ Cu |ulli2- leans 
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when dim M = 2, and 


2|lual[zallullcallVollce < vil Vollz2 + Coll z2 > [lel 


(5.59) 
+ Cu |ullze- lleallzs 


when dim M = 3. Consequently, 


d 
(5.60) Slut) 2 < Cu) lo(@)IlEe (ella + lleallza), 


where p = 4 if dim M = 2 and p = 8 if dim M = 3. Then Gronwall’s inequality 
gives 


oC Es < lun(0) uot Op Is ex { Cute) f° (lla(s) tbe + leas) as}, 


proving the proposition. 


We compare the properties of the last proposition with properties that Leray— 
Hopf solutions can be shown to have: 


Proposition 5.10. [fu is a Leray—Hopf solution to (5.1) and I = [0,T], then 


(5.61) ucL(Ix M) ifdimM =2, 
and 

(5.62) ue L83(7,L4(M))_ ifdim M =3. 
Also, 

(5.63) ue L?(I,L*(M))  ifdimM =4. 


Proof. Since u € L© (I, L?)  L?(I, H'), (5.61) follows from the first part of 
(5.56), and (5.62) follows from the second part. Similarly, (5.63) follows from the 
inclusion 

H'(M*) c L4(M%). 


In particular, the hypotheses of Proposition 5.9 are seen to hold for Leray—Hopf 
solutions when dim M = 2, so there is a uniqueness result in that case. On the 
other hand, there is a gap between the conclusion (5.62) and the hypothesis (5.57) 
when dim M = 3. 


Exercises 


In the exercises below, assume for simplicity that Ric = 0, so (5.5) holds with co = 0. 
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1. One place dissipated energy can go is into heat. Suppose a “temperature” function 
T = T(t, 2) satisfies a PDE 


(5.64) sas + Vil = aT + 4v|Def u)’, 


coupled to (5.6), where a is a positive constant. Show that the total energy 


E(t) = [{ueor +T(t,2)} dx 


M 


is conserved, provided u and T possess sufficient smoothness. Discuss local existence 
of solutions to the coupled equations (5.1) and (5.64). 
2. Show that under the hypotheses of Theorem 5.1, 


Uy > Vv, asyv — 0, 


v being the solution to the Euler equation (i.e., the solution to the v = 0 case of (5.6)). 
In what topology can you demonstrate this convergence? 
. Give the details of the interpolation argument yielding (5.39). 
4. Combining Propositions 4.3 and 4.5, show that if div uo = 0, p > n, and ||wol| ze is 
small enough, then (5.6) has a global solution 


iss) 


u € O([0, 00), L?) NC ((0,00) x M). 


In Exercises 5-10, suppose dim M = 3. Let u solve (5.6), with vorticity w. 
5. Show that the vorticity satisfies 


(5.65) — ||w(t)||722 = 2(Vwu, w) — 2v||Vwl|22. 


6. Using (Viu, w) = —(u, Vww) — (u, (div w)w), deduce that 


d 
— |lw(t)|Z2 < Cllullzs - llwllze - ||Vwllz2 — 2v||Vwllz2- 
dt 
Show that 
(5.66) |w||z6 < Cl|Vwllz2 + Cllullz2, 
and hence 
d 2 2 2 2 
S llw()liz2 < Cllullzs (Volz + lulls) — 27 Vuze. 
7. Show that 


1/2 VAD 
lulls < Cllullt2 > wll + Cllullre, 


and hence, if ||wo||,2 = @, 


d 
q Meyilze S C(8"? lwllrs + B) (IVwllz2 + 6?) — 2v||Vullz2. 
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8. Show that there exist constants A, B € (0,00), depending on M, such that 
(5.67) |Vwllz2 > Alleollz2 - BS”, 


and hence that y(t) = ||w(t)||72 satisfies 


=< 6(87y"/" + B) 8? — vAy + VBR? 
as long as 
(5.68) C (e'/?y"/4 cs 8) <v. 


9. As long as (5.68) holds, dy/dt < v8?(1 + B) — v Ay. As in (5.30), this gives 
y(t) < max{y(to), 37 (1 + BA, for t > to. 


Thus (5.68) persists as long as C(G(1 + By ae B) < v. Deduce a global 
existence result for the Navier-Stokes equations (5.1) when dim M = 3 and 


O(lluoll sf leo(O) hs + luollae) <v, 
(5.69) 
Olfuollu2 (1+ (1+ BY 4AM) <u. 


For other global existence results, see [Bon] and [Che1]. 
10. Deduce from (5.65) that 


d 


2 3 2 2 
q le Ollze < C(wllzs + llellzsllullz2) — 24 ||Vullze- 


Work on this, applying 


1/2 


1/2 
llwllrs < Cllewl 7.” - lel 


Lé? 
in concert with (5.66). 

11. Generalize results of this section to the case where no extra hypotheses are made on 
Ric. Consider also cases where some assumptions are made (e.g. Ric > 0, or Ric < 0). 
(Hint: Instead of (5.6) or (5.18), we have 

ou = vAu— Pdiv(u®u)+ PBu, Bu = 2v Ric(u).) 

12. Assume u is a Killing field on MM, that is, w generates a group of isometries of M/. 
According to Exercise 11 of § 1, u provides a steady solution to the Euler equation 
(1.11). Show that u also provides a steady solution to the Navier-Stokes equation 
(5.1), provided L is given by (5.4). If M = S 2 or S?, with its standard metric, show 
that such u (if not zero) does not give a steady solution to (5.1) if £ is taken to be 
either the Hodge Laplacian A or the Bochner Laplacian V*V. Physically, would you 
expect such a vector field u to give rise to a viscous force? 

13. Show that a t-dependent vector field u(t) on [0,7) x M satisfying 


u € L'((0,T), Lip'(M)) 
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generates a well-defined flow consisting of homeomorphisms. 
14. Let u be a solution to (5.1) with uo € L?(M), p > n, as in Proposition 5.3. Show 
that, given s € (0, 2], 


l(t) lar <Ct/?, O<t<T. 


Taking s € (1+ n/p, 2), deduce from Exercise 13 that u generates a well-defined 
flow consisting of homeomorphisms. 

For further results on flows generated by solutions to the Navier-Stokes equations, see 
[ChL] and [FGT]. 


6. Viscous flows on bounded regions 


In this section we let be a compact manifold with boundary and consider the 
Navier-Stokes equations on R* x Q, 


a 
(6.1) +V,u=vlu— gradp, divu=0. 


We will assume for simplicity that 2 is flat, or more generally, Ric = 0 on Q, so, 
by (5.4), £ = A. When 00 4 0, we impose the “no-slip” boundary condition 


(6.2) u=0, forx € ON. 
We also set an initial condition 
(6.3) u(0) = uo. 


We consider the following spaces of vector fields on 2, which should be com- 
pared to the spaces V, of (1.6) and V* of (3.4). First, set 


(6.4) V= {ue C(O, TO) : div u =O}. 
Then set 
(6.5) W* = closure of Vin H*(Q,T), k=0,1. 


Lemma 6.1. We have W° = V° and 
(6.6) W? = {ue H3(Q,T) : divu = 0}. 


Proof. Clearly, W° c V®. As noted in § 1, it follows from (9.79)-(9.80) of 
Chap. 5 that 


(6.7) (V°)* = {Vp: pe H'(Q)}, 
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the orthogonal complement taken in L?(,7’). To show that V is dense in V®, 
suppose u € L?(0,7) and (u,v) = 0 for all v € V. We need to conclude 
that wu = Vp for some p € H1(Q). To accomplish this, let us make note of the 
following simple facts. First, 


(6.8) V: H'(Q) + L?(Q,T) has closed range Ro; Ro = ker V* = V°. 
The last identity follows from (6.7). Second, and more directly useful, 


V : L?(Q) + H~'(Q,T) has closed range Ri, 


(6.9) 
Rt = ker V* = {u € HU(Q,T) : divu=0} = WY, 
the last identity defining Ww. 

Now write 2 as an increasing union Q, CC Q2 CC ::- AQ, each Q; having 
smooth boundary. We claim u; = ul, is orthogonal to ws”, defined as in (6.9). 
Indeed, if v € We (and you extend v to be 0 on 2 \ (,), then p(eV—A)v = vz 
belongs to V if p € C§°(R) and ¢ is small, and v- > v in H'-norm if p(0) = 1, 
so (u,v) = lim(u, ve) = 0. From (6.9) it follows that there exist p; € L?(Q;) 
such that u = Vp; on Q;; p,; is uniquely determined up to an additive constant 
(if Q; is connected) so we can make all the p; fit together, giving u = Vp. If 
u € L?(Q,T), p must belong to H1(). 

The same argument works if u € H~!(Q,T) is orthogonal to V; we obtain 
u = Vp with p € L?(Q); one final application of (6.9) then yields (6.6), finishing 
off the lemma. 


Thus, if ua € W!, we can rephrase (6.1), demanding that 


(6.10) (u,v) wo + (Vuu,v) wo = —v(u,v)wi, forallu € V. 


dt 
Alternatively, we can rewrite the PDE as 


(6.11) a +PV,u=—vAu. 

ot 
Here, P is the orthogonal projection of L?(Q,7) onto W® = V°, namely, the 
same P as in (1.10) and (3.1), hence described by (3.5)-(3.6). The operator A is 
an unbounded, positive, self-adjoint operator on W°®, defined via the Friedrichs 
extension method, as follows. We have Ap : W' > (W')* given by 


(6.12) (Agu, v) = (u,v) wi = (du, dv) r2, 
the last identity holding because div u = div v = 0. Then set 


(6.13) D(A) = {ue W?: Aue W}, A=Aolyra)s 
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using W! c W° c (W!)*. Automatically, D(A!/?) = W!. The operator A is 
called the Stokes operator. The following result is fundamental to the analysis of 
(6.1)-(6.2): 


Proposition 6.2. D(A) Cc H?(Q,T). In fact, D(A) = H7(0,T)W!. 
In fact, if w € D(A) and Au = f € W®, then (f,v)z2 = (—Au,v)z2, for 


all v € V. We know Au € H™!, so from Lemma 6.1 and (6.9) we conclude that 
there exists p € L?(Q) such that 


(6.14) —Au= f+Vop. 


Also we know that div u = 0 and u € Hj(Q,T). We want to conclude that 
u € H? and p € H'. Let us identify vector fields and 1-forms, so 


(6.15) —Au=ftdp, du=0, 0. 


Ulan = 


In order not to interrupt the flow of the analysis of (6.1)-(6.2), we will show in 
Appendix A at the end of this chapter that solutions to (6.15) possess appropriate 
regularity. 


We will define 
(6.16) W* = D(A*/?), 5 >0. 
Note that this is consistent with (6.5), for s = k = Oor 1. 


We now construct a local solution to the initial-value problem for the Navier— 
Stokes equation, by converting (6.11) into an integral equation: 


t 
(6.17) u(t) =e A ug — | e(s-HvA p div(u(s) @ u(s)) ds = Wu(t). 
0 


We want to find a fixed point of YW on C(I, X), for J = [0, T], with some T > 0, 
and X an appropriate Banach space. We take X to be of the form 


(6.18) aw = D(A"), 


for a value of s to be specified below. As in the construction in §5, we need a 
Banach space Y such that 


(6.19) ® : X — Y is Lipschitz, uniformly on bounded sets, 
where 


(6.20) ®(u) = P div(u@ u), 
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and such that, for some y < 1, 
(6.21) lle lex) S$ C8, 
for t € (0, 1]. We take 
(6.22) Yow". 


As |le~'4 || ccwo,ws) ~ Ct78/? fort < 1, the condition (6.21) requires s € (0, 2), 
in (6.18). We need to verify (6.19). Note that, by Proposition 6.2 and interpolation, 


(6.23) W* c H°(9,T), forO<s <2. 
Thus (6.19) will hold provided 
(6.24) M : H9(Q,T) > H'(9,T @T), with M(u) =u® u. 


Lemma 6.3. Provided dim Q < 5, there exists sy < 2 such that (6.24) holds for 
all s > So. 


Proof. If dim M = n, one has 
He/2+e ‘ Ar/2te Ee HD/2+e and H/4 : H/4 Cc H® = if, 
the latter because H”/4 C L*. Other inclusions 


1 
(6.25) H"-H" cH’, ratte +66, o=0(sn+e), 


follow by a straightforward interpolation. One sees that (6.24) holds for s > so 
with 


(ifn > 2). 


Nir 


(6.26) ge 
4 
For 2 <n <5, So increases from 1 to 7/4; forn = 6, s9 = 2. 
Thus we have an existence result: 


Proposition 6.4. Suppose dim Q < 5. If 89 is given by (6.26) and ug © W®* for 
some s € (80,2), then there exists T > 0 such that (6.17) has a unique solution 


(6.27) u € C((0,7],W*). 
We can extend the last result a bit once the following is established: 


Proposition 6.5. Set V° = V°N H%(Q,T), for0 < s < 1. We have 
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(6.28) W*=V°*, for0O<s< - 
and hence 

(6.29) P: H°(Q,T) — W*, 
for such s. 


Proof. To deduce (6.29) from (6.28), note that, by (3.5), P : H*(0,T) > 
H*(Q,T) for s = 0,1, hence, by interpolation, for all s € [0,1], so P : 
H*(Q,T) > V*, for s € [0,1]. 
To establish (6.28), recall that W! = D(A!/?) = V°n H4(Q,T). We hence 
have 
Wes (VV rae), ford <2 1. 


Thus (6.28) will follow from the identity 
(6.30) [VV nH, = VON O,7), AO, OXs<1, 


since, as seen in (6.37) of Chap. 4, 

1 
(6.31) [Z7(0), Ae), =A), ford ex 5 
Following [FM], we make use of the following result to establish (6.30): 


Lemma 6.6. There is a continuous projection Q from L?(Q,T) onto V° such 
that Q maps H?(Q,T)N HA(Q,T) = D(A) to H7(Q,T) NW! = D(A). 


Here A is the Laplace operator on 2, with Dirichlet boundary condition. We 
know that 


(6.32) [£7(9,T), H2(Q,T) 9 HAQ,T)1/2 = D((-A)”?) = HA(Q,T), 
so the lemma implies that the projection Q has the property 
(6.33) Q: Hg(Q,T) 3 Wt =V° nN BB(Q,T), 


and (6.30) is a straightforward consequence of this result. 


Proof of lemma. We define the continuous operator Qo : D(A) > D(A) by 
(6.34) Qou=—A7'PAu, we D(A). 


Since Qou = u for u € D(A) = D(A) NV° and since D(A) is dense in V®, it 
suffices to show that Qo can be extended to a bounded operator from L?(, 7’) to 
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V°. Indeed, by the self-adjointness of A and A, we have, for the adjoint, mapping 
V' teL7(0,7), 


(6.35) Qj =—-AtA™, 6: V° Oo 10,7), 


which is a bounded operator from V° to L?(Q,7), since the inclusion . maps 
D(A) into D(A). This proves the lemma, so Proposition 6.5 is established. 


We now return to the integral equation (6.17), replacing Y = W° in (6.22) by 
1 
(6.36) Y=W°=V’, ce [o. 5): 


We take X = W*, as in (6.18), and this time we need s — o € (0, 2) in order for 
(6.21) to hold with y < 1. Higher regularity for the Stokes operator gives 


(6.37) W* c H5(0,T), fors ER, 
extending (6.23). Thus (6.19) will hold provided we extend (6.24) to 
(6.38) M : H°(0,T) — H'*7(Q,T@T), M(u)=u@u. 
Let us write i 5 

c= i s=2+a—d=5 — 20. 
By the arguments used in Lemma 6.3, we have the following: 


Lemma 6.7. Provided dim Q < 6, if 6 € (0,1/2) is small enough, and o = 
1/2 — 6, there exists 89 € (0,2 +c) such that (6.38) holds for all s > 89. 


Proof. If n < 4, then H*(Q) is an algebra for s = 5/2 — 26 if 6 is small enough. 
If n > 5, we can take sy = (n+ 38) /4. 


Thus we have the following complement to Proposition 6.4: 


Proposition 6.8. Suppose dim Q < 6. If 5 > 0 is small enough, s = 5/2 — 26, 
and ug € W®, then there exists T > 0 such that (6.17) has a unique solution in 
C([0,T],W*). 


There are results on higher regularity of strong solutions, for 0 < t < T. We 
refer to [Tem3] for a discussion of this. 

Having treated strong solutions, we next establish the Hopf theorem on the 
global existence of weak solutions to the Navier-Stokes equations, in the case of 
domains with boundary. 


Theorem 6.9. Assume dim Q. < 3. Givenug € W°,v > 0, the system (6.1)-(6.3) 
has a weak solution for t € (0,00), 
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(6.39) ue L(R*,W°) 7 2.(R*, WwW"). 


The proof is basically parallel to that of Theorem 5.6. We sketch the argument. 
As above, we assume for simplicity that 9 is Ricci flat. We have the Stokes oper- 
ator A, a self-adjoint operator on W°®, defined by (6.12)-(6.13). As in the proof 
of Theorem 5.6, we consider the family of projections J: = y(¢A), where x(A) 
is the characteristic function of [—1, 1]. We approximate the solution u by ue, 
solving 


Ouz 


(6.40) at 


+ J-P div(uz ® uz) =—vAuz, uz(0) = Jeug. 


This has a global solution, uz € Cc ([0, oo), Range Je), As in (5.32), {ue} is 
bounded in L°°(R*, L?(Q)). Also, as in (5.33), 


iT 
(6.41) wv | \|Vue(t)|Iz2 dt = || Jeuollz2 — |lue(T)Ilz2, 
0 


for each T € R*. Thus, parallel to (5.34), for any bounded interval J = (0, 7], 
(6.42) {u-} is bounded in L?(I,W'). 
Instead of paralleling (5.36)-(5.39), we prefer to use (6.42) to write 
(6.43) {Vu, Ue} bounded in L1 (I, L°/2(Q)), 
provided dim 2 < 3. In such a case, we also have 
P:W' > #'(9) c 139), 
and hence 
(6.44) PPOs (wy. 


Also {J-} is uniformly bounded on W! and its dual (W')*, and A : Wt > 
(W+)*. Thus, in place of (5.37), we have 


(6.45) {O;u-} bounded in L*(I,(W')*), 
SO 

7 c s 1)* 1 
(6.46) {uz} is bounded in H°(I,(W1)*), Vse (0, 5): 


Now we interpolate this with (6.42), to get, for all 6 > 0, 
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(6.47) {ue} bounded in H*(I,H'~*(Q)), s=s(6) >0, 
hence, parallel to (5.40), 
(6.48) {ue} is compact in L?(I,H!~°(Q)), Vd>0. 


The rest of the argument follows as in the proof of Theorem 5.6. 
We also have results parallel to Propositions 4.9-4. 10: 


Proposition 6.10. Let wu; and uz be weak solutions to (6.11), satisfying 
(6.49) uj EL@U,W)nd,W'), ue L*(rx). 
Suppose dim Q = 2 or 3; if dim Q = 3, suppose furthermore that 
(6.50) uy € LS(I, L4(Q)). 

Tfui(0) = ua(0), then uy = ug onI x Q. 


The proof of both this result and the following are by the same arguments as 
used in § 5. 


Proposition 6.11. [fu is a Leray—Hopf solution and I = [0,T), then 


(6.51) ue L*(IxQ) ifdima=2 
and 
(6.52) we L3(1,14(Q)) ifdima =3. 


Thus we have uniqueness of Leray—Hopf solutions if dim 2 = 2. The follow- 
ing result yields extra smoothness if ug € W?: 


Proposition 6.12. If dim Q = 2, and u is a Leray—Hopf solution to the Navier— 
Stokes equations, with u(0) = ug € W1, then, for any I = [0,T],T < 0x, 


(6.53) ue L@(,W')nL7(1,W?), 
and 

Ou 9 6 
(6.54) aoe (I,W°). 


Proof. Let u; be the approximate solution u- defined by (6.40), with e = e; — 0. 
We have 
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1d 1/2 2 2 
(6.55) Saale uj(t)|| 2 + v||Auj()||i2 = —(Vuj us, Auy) po, 
upon taking the inner product of (6.40) with Au-. Now there is the estimate 
(6.56) |(Vu;uj,Auy)| < CllVujllzs. 


To see this, note that since do (I — P) = 0, we have, for u € W?, 
1 
(Vyu,u)y, = (dV,u,du) = ([d, VyJu, du) + 9 (Vu + Vi] du, du), 


and the absolute value of each of the last two terms is easily bounded by 
f |Vul? av. 

In order to estimate the right side of (6.56), we use the Sobolev imbedding 
result 


(6.57) HQ) c L3(Q), dim =2, 


which implies [|x|] z3 < Cllu||74° lull 422, so 
|Vuslzs < Cl Vullze - Vuyllze 


(6.58) as 
< O'(vd)"*||Vujl|z2 + C'vd||Vujllin- 


We have ||Vuj||71 < C|| Au;||Z2+Cllu,;||7.2, by Proposition 6.2, so if 6 is picked 
small enough, we can absorb the ||Vuw,||7,:-term into the left side of (6.55). We 
get 


ld 1/2 2 Vy . 
os 2a IP s(Dllce + glAusOlis 
< CHAP uj(O) lia + C (lus (Ollés + lull). 


We want to apply Gronwall’s inequality. It is convenient to set 


(6.60) o;() = ||A¥2uj(2)|| 12, OCA) = A442. 
The boundedness of uz in L7.(R*,W') (noted in (6.42)) implies that, for any 
T<o, 
T 
(6.61) | o;(t) dt < K(T) <0, 
0 


with A(T) independent of j. If we drop the term (1/2)|| Au, (t)||7.2 from (6.59), 
we obtain 
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|A°Pus®[p2 < Cos()|AY?uj(t)[[p2 + CO (uj Ollz2 ), 


d 
(6.62) a << 


and Gronwall’s inequality yields 
(6.63) 


t 
2 2 
AM uss SRO [AMA ugls + CKO J O(Ins(sllz2) a 
0 
This implies that u; is bounded in L*° (J, W*), and then integrating (6.59) implies 
u,; is bounded in i, W?). The conclusions (6.53) and (6.54) follow. 


The argument used to prove Proposition 6.12 does not extend to the case in 
which dim 2 = 3. In fact, if dim Q = 3, then (6.57) must be replaced by 


(6.64) HY/2(0) c L3(Q), dim =3, 


which implies |v||z3 < Chole llo| ne and hence (6.58) is replaced by 


3/2 3/2 
[[Vuslzo < Cll Va,R2 - Vuyl34 


(6.65) ie ‘ ; 
< O(v8)~3||Vujll$2 + Cvd||Vujll3q. 


Unfortunately, the power 6 of ||Vuw,||z2 on the right side of (6.65) is too large in 
this case for an analogue of (6.60)—(6.63) to work, so such an approach fails if 
dim 2 = 3. 

On the other hand, when dim 2 = 3, we do have the inequality 


d V 
qari Ollzs a 5 llAuy Ollze 


< OAM? a; ()Ilb2 + C (les (OllL2 + lus Ize). 


= 
(6.66) 2 


We have an estimate ||w,;(t)||;2 < A, so we can apply Gronwall’s inequality to 
the differential inequality 


saYilt) SC¥i(H) + OU + K?) 


to get a uniform bound on Y;(t) = ||A‘/?u;(¢)||?.2, at least on some interval 


(0, To]. Thus we have the following result. 


Proposition 6.13. [f dim Q = 3, and u is a Leray—Hopf solution to the Navier— 
Stokes equations, with u(0) = uo € W', then there exists Ty = To(||uol| w2) > 0 
such that 

(6.67) u€ L™([0, To], W*) N L?([0, To], W7), 


and 
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(6.68) — € L?([0, To], W°). 


Note that the properties of the solution u on [0, To] x © in (6.67) are stronger 
than the properties (6.49)-(6.50) required for uniqueness in Proposition 5.10. 
Hence we have the following: 


Corollary 6.14. If dim Q = 3 and uy and uy are Leray—Hopf solutions to the 
Navier-Stokes equations, with uo(0) = u2(0) = uo € W1, then there exists 
To = To(||uollw1) > 0 such that uy (t) = u2(t) for0 < t < To. 

Furthermore, if u € W®* with s € (80,2) as in Proposition 5.4, then the 
strong solution u € C((0,T],W*) provided by Proposition 6.4 agrees with any 
Leray—Hopf solution, for 0 < t < min(T, Tp). 


As we have seen, a number of results presented in § 5 for viscous fluid flows 
on domains without boundary extend to the case of domains with boundary. We 
now mention some phenomena that differ in the two cases. 

The role of the vorticity equation is altered when 0 # Q. One still has the 
PDE for w = curl u, for example, 


an +Vyw =vAw (dim 2 = 2), 
(6.69) - 
op + Vuw — Vou = vAw (dim 2 = 3), 


but when 0Q # 9, the initial value w(0) alone does not serve to determine w(t) 
for ¢ > 0 from such a PDE, and a good boundary condition to impose on w(t, x) 
is not available. This is not a problem in the v = 0 case, since wu itself is tangent to 
the boundary. For v > 0, one result is that one can have w(0) = 0 but w(t) 4 0 
for t > 0. In other words, for v > 0, interaction of the fluid with the boundary 
can create vorticity. 

The most crucial effect a boundary has lies in complicating the behavior of 
solutions u, in the limit » — 0. There is no analogue of the v-independent 
estimates of Propositions 4.1 and 4.2 when 0Q. 4 §. This is connected to the 
change of boundary condition, from u,|9q = 0 for v positive (however small) to 
n-ulag = 0 when v = 0, n being the normal to OQ. Study of the small-v limit 
is important because it arises naturally. In many cases flow of air can be modeled 
as an incompressible fluid flow with vy ~ 10—°. However, after more than a cen- 
tury of investigation, this remains an extremely mysterious problem. See the next 
section for further discussion of these matters. 


Exercises 


1. Show that D(A*) c H?*(Q,T), for k € Z*. Hence establish (6.37). 

2. Extend the L?-Sobolev space results of this section to L?-Sobolev space results. 

3. Work out results parallel to those of this section for the Navier-Stokes equations, when 
the no-slip boundary condition (6.2) is replaced by the “slip” boundary condition: 
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(6.70) 2v Def(u)N —pN =0 on OQ, 


where NV is a unit normal field to OQ and Def(w) is a tensor field of type (1,1), given 
by (2.60). Relate (6.70) to the identity 


(vLu — Vp,v) = —2v (Def u, Def v) whenever div v = 0. 


7. Vanishing viscosity limits 
In this section we consider some classes of solutions to the Navier-Stokes equa- 


tions 


Ou” 
Ot 


(7.1) +Vyu" + Vp’ =vAu" +F”’, divu” =0, 


on a bounded domain, or a compact Riemannian manifold, Q (with a flat metric), 
with boundary OQ, satisfying the no-slip boundary condition 


(7.2) w lat xaa = 9 
and initial condition 
(7.3) u” (0) = uo, 


and investigate convergence as v —> 0 to the solution to the Euler equation 
(7.4) —+V,0u°+Vp° =F, divu® =0, 
with boundary condition 
(7.5) u® || AQ, 
and initial condition as in (7.3). We assume 
(7.6) divuo, uo || 0Q, 
but do not assume wp = 0 on 02. 
When 00 # Qj, the problem of convergence u” —> u° is very difficult, and 
there are not many positive results, though there is a large literature. The enduring 
monograph [Sch] contains a great deal of formal work, much stimulated by ideas 


of L. Prandtl. More modern mathematical progress includes a result of [Kt7], that 
u”(t) > u(t) in L?-norm, uniformly in t € [0,7], provided one has an estimate 


0 
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T 
(7.7) vf / |\Vu" (t, x)|? dxdt —+0, as vy 30, 
0 
r 


cv 


where D5 = {a € Q : dist(z,0Q) < 6}. Unfortunately, this condition is not 
amenable to checking. In [W] there is a variant, namely that such convergence 
holds provided 


T 
(7.8) vf / |\Vru” (t,x) |? dxdt —+ 0, as v +0, 
0 


a) 


with 7(v)/v — oo as vy — 0, where Vr denotes the derivative tangent to 0. 

Here we confine attention to two classes of examples. The first is the class of 
circularly symmetric flows on the disk in 2D. The second is a class of circular 
pipe flows, in 3D, which will be described in more detail below. Both of these 
classes are mentioned in [W] as classes to which the results there apply. How- 
ever, we will seek more detailed information on the nature of the convergence 
u” — u®. Our analysis follows techniques developed in [LMNT, MT1, MT2]. 
See also [Mat, BW, LMN] for other work in the 2D case. Most of these papers 
also treated moving boundaries, but for simplicity we treat only stationary bound- 
aries here. 

We start with circularly symmetric flows on the disk Q = D = {x € R?: 
|x| < 1}. Here, we take F” = 0. By definition, a vector field wo on D is circularly 
symmetric provided 


(7.9) uo( Rox) = Roug(xz), Va e D, 


for each 0 € [0,27], where Rg is counterclockwise rotation by 6. The general 
vector field satisfying (7.9) has the form 


(7.10) so(|x|)a+ + s1(|a|)x, 


with s; scalar and c+ = Jx, where J = R,/2, but the condition div ug = 0 
together with the condition wo || OD, forces s; = 0, so the type of initial data we 
consider is characterized by 


(7.11) uo(x) = so(\a|)a+. 


It is easy to see that divug = O for each such uo. Another characterization of 
vector fields of the form (7.11) is the following. For each unit vector w € S$! C 
R?, let ®,, : R? —> R? denote the reflection across the line generated by w, i.e., 
®(aw + bJw) = aw — bJw. Then a vector field wo on D has the form (7.11) if 
and only if 


(7.12) uo(®,2) = —®,up(z), Vwe S', xe D. 
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A vector field uo of the form (7.11) is a steady solution to the 2D Euler equation 
(with F° = 0). In fact, a calculation gives 


(7.13) Vuotto = —80(|z|)2a2 = —Vpo(2), 
with 
1 
(7.14) po(2) = Bo(|z!),  Folr) =— / s0(p)2p dp. 


Consequently, in this case the vanishing viscosity problem is to show that the 
solution wu” to (7.1)-(7.3) satisfies u” (t) + up as v + 0. The following is the key 
to the analysis of the solution wu”. 


Proposition 7.1. Given that uo has the form (7.11), the solution u” to (7.1)-(7.3) 
(with F” = 0) is circularly symmetric for each t > 0, of the form 


(7.15) u’(t,2) = 9" (t, |z))a~, 


and it coincides with the solution to the linear PDE 


Ou” 
Ot 


(7.16) =vAu’, 


with boundary condition (7.2) and initial condition (7.3). 


Proof. Let u” solve (7.16), (7.2), and (7.3), with uo as in (7.11). We claim (7.15) 
holds. In fact, for each unit vector w € R?, —®,,u”(t, ®,,) also solves (7.16), 
with the same initial data and boundary conditions as u”, so these functions must 
coincide, and (7.15) follows. Hence div u” = 0 for each t > 0. Also we have an 
analogue of (7.13)-(7.14): 


Vuru" = Vp, p’ (t,x) = p(t, |x|), 


(7.17) 1 
p’(t,r) = -{ s”(t,p)pdp. 


Hence this u” is the solution to (7.1)—(7.3). 

To restate matters, for Q = D, the solution to (7.1)-(7.3) is in this case simply 
(7.18) u’ (t, 2) = e”*4uo(a). 
The following is a simple consequence. 


Proposition 7.2. Assume uo, of the form (7.11), belongs to a Banach space X of 
R?-valued functions on D. If {e'“ : t > 0} is a strongly continuous semigroup 
on X, then u’(t,-) 4 ug in ¥ as v — 0, locally uniformly in t € {0, 00). 


628 17. Euler and Navier-Stokes Equations for Incompressible Fluids 


As seen in Chap. 6, {e’“ : ¢ > 0} is strongly continuous on the following 
spaces: 


(7.19)  L(D), 1<p<o, C(D)={f €C(D): f =0on dD}. 


Also, it is strongly continuous on D, = D((—A)*/?) for all s € R*+. We recall 
from Chap. 5 that 


(7.20) Dz = H?(D)NHg(D), Di = Ho(D), 
and 
(7.21) Dy = ([L°(D), AyD) \s5. O0< e <1: 


In particular, by interpolation results given in Chap. 4, 


1 
(7.22) D,=H*(D), 0<s< ei 
We also mention Proposition 7.4 of Chap. 13, which implies this heat semigroup 
is strongly continuous on 


(7.23) Cy(D) ={f € C'(D): f =0 on AD}. 


On the other hand, if uo € C(D) but does not vanish on OD, then e’“ug does not 
converge uniformly to uo on D, as t + 0, though as shown in Corollary 8.2 of 
Chap. 6, we do have convergence of e’“ug to uo uniformly on compact subsets 
of D. Thus there is a boundary layer attached to 0D where uniform convergence 
fails. We recall Proposition 8.3 of Chap. 6 in the current context. 


Proposition 7.3. Given uo € C%°(D), we have, as v \, 0, locally uniformly in 


t € Rt, 
+ k 
et Aug ~ uo(x) + S- WA" nhuo(a) 
(7.24) ve 
— $7 2b, («) (4vt)9/?E; (SS). 
Vv 


j20 


Here, b; € C*(D), p(x) = dist(x, 0D) = 1 — |z|, and the special functions 
E,(y) are given by 


(7.25) E,(y) = — ie e~* (s—y) ds. 
y 


We mention that bb) = uo on OD and b;\apn = 0 for j odd. Also, Eo(0) = 1/2. 
The primary “boundary layer” term is 
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l= 
(7.26) ~2u9(z) Eo( a ). 


and we see the boundary layer thickness is ~ /4vt. 

We pass from this class of 2D problems to the following class of 3D problems. 
We look for solutions to (7.1)-(7.3) with uw” = u’(t,x,z),p” = p’ (t,x, 2), 
(t,2,z) € Rt x Q, where Q = D x R, D being the 2D disk as above. Thus 
Q. is an infinitely long circular pipe. In this case, we consider external force fields 
of the form 


(7.27) Fv(t,@, 2) = (0, f" (4), 


so F’” is parallel to the z-axis, with z-component f”(t). We take initial data of the 
following form: 


(7.28) u" (0, , 2) = uo(x) = (vo(2), wo(x)), 


where vo is a vector field on D and wg is the z-component of wo. We require the 
conditions 


(7.29) divuo = 0, up ||/OQ, t.e., divug =0, v9 || OD, 


and we require the vector field vp on D to be circularly symmetric, so, as in (7.11), 
vo(x) = so(|a|)a+, hence 


(7.30) ug (x) = (so(|x|)a~, wo(a)). 


The fact that Q is infinite is inconvenient. To get the theoretical treatment 
started, it is convenient to modify the set-up by requiring that solutions be periodic 
(say of period L) in z, so we replace Q by 0; = D x (R/LZ). In such a case, 
results of § 6 imply that, for each v > 0, (7.1)-(7.3) has a unique strong, short time 
solution, given mild regularity hypotheses on vp(a) and wo() (a solution that, as 
we will shortly see, persists for all time ¢ > 0 under the current hypotheses), and 
the solution is z-translation invariant, i.e., 


(7.31) w= (UP bm )GR0" (Ej), p’ = p’ (t, 2). 
Consequently, 
(7.32) Vuru’ = (Vyrv", Vy w”),  divu’ = divv”. 


Hence, in the current setting, (7.1) is equivalent to the following system of equa- 
tions on Rt x D: 


Ov” 


(7.33) OE +V pu" + Vp" =vAv", divv’” =0, 
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Ow” 


Ot 


(7.34) + Vyw" =vAw" + f”. 
Note that (7.33) is the Navier-Stokes equation for flow on D, which we have 
just treated. Given initial data satisfying (7.30), we have 


(7.35) uv" (t, 2) = e”*4.u9(a), 


where A is the Laplace operator on D, with Dirichlet boundary condition. The 
results of Proposition 7.2, complemented by (7.19)-(7.23) apply, as do those of 
Proposition 7.3, taking care of (7.33). 

It remains to investigate (7.34). For this, we have the initial and boundary con- 
ditions 


(7.36) w” (0) = wo, 0, 


tO | cag = 


and we ask whether, as v \, 0, w” converges to w®, solving 


Ow? 0 0 0 
(7.37) PE +V pw =f (t), w (0) =wo. 
We impose no boundary condition on w®, which is natural since v° = vo is tan- 


gent to OD. 

Before pursuing this convergence question, we pause to observe a class of 
steady solutions to (7.33)-(7.34) known as Poiseuille flows. Namely, given a € 
R\ 0, 


(7.38) uo(x) = a(0,1 — |a|”) 

is such a steady solution, with 

(7.39) p’(t,7) =0, f”’(t) = (0,4va). 

An alternative description is to set 

(7.40) p(t, 2,2) =—4vaz, f’(t) =0. 

This latter is acommon presentation, and one refers to Poiseuille flow as “pressure 
driven” However, this presentation does not fit into our set-up, since we passed 
from the infinite pipe D x R to the periodized pipe D x (R/LZ), and p” in (7.40) 
is not periodic in z. These Poiseuille flows do fit into our set-up, but we need to 


represent the force that maintains the flow as an external force. 
We return to the convergence problem. For notational convenience, we set 


(7.41) Xy = Vow = 8"(t, |x|) ¢ 


0 
70 X=V,0 so(lal) a5. 
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Thus we examine solutions to 


Ow” 
(7.42) at 
w’ (0,2) = wo(2), Cg en = 0, 


= vAw" — X,w" + f’ (6), 


compared to solutions to 


0 
(7.43) — = —Xw® + f(t), w°(0,2) = wo(z). 


We do not assume wo|ap = 0. In order to separate the two phenomena that make 
(7.42) a singular perturbation (7.43), namely the appearance of vA on the one 
hand and the replacement of X by X, on the other hand, we rewrite (7.42) as 


Ow” 


(7.44) 7 


= (vA— X)w" + (X — X,)w”’ + f’ (2), 


and apply Duhamel’s formula to get 


t 
(7.45) w"(t) = Ay | el s\VA-X)(X _ X,)w"(s) + f’(s)] ds. 
0 


By comparison, we can write the solution to (7.43) as 


(7.46) w(t) = e * wy + | f°(s) ds. 
Consequently, 
(7.47) w(t, 2) — w(t, x) = Ri(v,t, x) + Ro(v,t, 2) + R3(v,t, 2), 
where 
Ry(v,t, x2) = QOS ig eo ap, 


t 
74g) Palortea) =f [pr(syet4-1 — $°(s)] as, 


t 
R3(v,t, x) =| elt-s(PA=X) (gr OU a 
9 oo 


The term Rg is the easiest to treat. By radial symmetry, 


elt-s)(VA-X) l= gore, 
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and we can write 
Ra(vst2) = [[P(s)- M6)] 4 


+f f’(s)[e@-941 — 1] ds. 


(7.49) 


The uniform asymptotic expansion of the last integrand is a special case of (7.24): 


(7.50) e941 1 N° 2; (2) (V(t - s))/?E,( 
j20 


with b; € C°(D), bolap = 1, and E; as in (7.25). The principal contribution 


giving the boundary layer effect for the term Ro(v,t, x) is 


- a 
(7.51) 2 fr F’( s)Eo( Tana)” 


Methods initiated in [MT1] and carried out for this case in [MT2] produce 
a uniform asymptotic expansion for Ri(v,t, 2) almost as explicit as that given 
above for Rz, but with much greater effort. Here we will be content to present 


simpler estimates on ,. Our analysis of 


(7.52) WY’ (t, 2) = e&”4-*) wo (ar) 


starts with the following. Recall that X is divergence free and tangent to OD. 


Lemma 7.4. Given v > 0, 

(7.53) D((vA — X)) =D(A4), 7 =1,2. 
Proof. We have, for v > 0, 

(7.54) DvA—X)={f € H?(D): fly, =}, 


and 


(7.55) D((vA-X)?)={f € H*(D): fp, =vAf-Xflap = 0} 


The first space is clearly equal to D(A). Since X is tangent to OD, flap 
0 + X flap = 0, so the second space coincides with {f € H*(D) : 


A flap = 0}, which is D(A?). 


Remark: The analogous identity of domains typically fails for larger 7. 


4v(t — s)/’ 
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To proceed, since W” in (7.52) satisfies 0,;W” = —XW” + vAW’”, we can 
use Duhamel’s formula to write 


t 
(7.56) W(t) = eX wo + » | e (9) X AW" (s) ds, 
0 
hence 
t 
(7.57) jet 4 - ang — e** woz» < vf AW” (s)||z> ds. 
0 


The following provides a useful estimate on the right side of (7.57) when p = 2. 


Lemma 7.5. Take wo € D(A?) = D((vA — X)?), and construct W” as in 
(7.52). Then there exists K € (0,00), independent of v > 0, such that 
(7.58) JAW” (t)|[Z2 < e?**||Awollz2- 


Proof. We have 


SAW (yl: = 2Re(Aa,W", AW”) 
= 2Re(vA?W”, AW”) — 2Re(AXW”, AW”) 


(7.59) < —2Re(AXW”, AW”) 
= —2Re(XAW”, AW”) — 2Re([A, X]W”, AW”) 
< 2K AW" 72, 


with i independent of v. The last estimate holds because 
(7.60) g € D(A) = |(Xg,9)| < Kallgllie, 


and 


(7.61) 
W(t) € D(A?) = [A, X]W”(t) € L?(D), and 


A, X]W"()|Iz2 < Kol|W"()|lz2 < Kol|AW” )|Iz2- 
The estimate (7.58) follows. 


We can now prove the following. 
Proposition 7.6. Given p € [1,00), wo € L?(D), we have 
(7.62) Ong — ewe, aru \, 0, 


with convergence in L?-norm. 
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Proof. We know that e’”“ is a contraction semigroup on L?(D) and e~'* is a 
group of isometries on L?(D), and we have the Trotter product formula: 


(7.63) &VA—X)ang = lim (elt Ae UM) X) "9, 


in L?-norm, hence e’(”4—*) is a contraction semigroup on L?(D). By (7.58) and 


(7.57), we have L? convergence for wo € D(A”), which is dense in L?(D). This 
gives (7.62) for p = 2, by the standard approximation argument, a second use of 
which gives (7.62) for all p € [1, 2]. 

Suppose next that p € (2,00), with dual exponent p’ € (1,2). The previous 
results work with X replaced by —X, yielding e”4*+%) g + e'*g, as v \, 0, in 
L”’ -norm, for all g € L?’(D). This implies that for wo € L?(D), convergence in 
(7.62) holds in the weak* topology of L?(D). Now, since e~'* is an isometry on 
L?(D), we have 
vA—X) 


(7.64) \le~** wollz» > limsup |e“ wollz, 
v—-0 


for each wo € L?(D). Since L?(D) is a uniformly convex Banach space for 
such p, this yields L?-norm convergence in (7.62). 


To produce higher order Sobolev estimates, we have from (7.58) the estimate 
(7.65) je wo || Dray < eX wollpca), 


first for each wo € D(A?), hence for each wo € D(A). Interpolation with the 
L?-estimate then gives 


(7.66) leo“ 49 wo l>(—ays/2y S e**||wolld(—ays/2)s 


for each s € [0,2], wo € D((—A)*/?) = D,. As noted in (7.22), D, = H*(D) 
for 0 < s < 1/2, so we have 


1 
(7.67) |e A- wo ll e(p) < Ce**||wollus(p), OS 8 < 5 


with C and K independent of v € (0, 1]. We can interpolate the estimate (7.67) 
with 


(7.68) le woll ecw)  llwollze(p), 1 <p <x. 
Using 
1 1-6 @ 
(7.69) H*(D), L?(D)|¢ = H°-9*9) (D), = an 
[H*(D), L?(D)]o (D) (0) TS 
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which follows from material in Chap. 13, § 7, we have 


(7.70) Ije6 4) wo || zxe.4(D) < Co,qe™*||w|| z2.<(D); 
valid for 
(7.71) 2<q<o, oqé (0,1). 


Similar arguments give such operator bounds on e~'*. We have the following 
convergence result. 


Proposition 7.7. Let o, q satisfy (7.71). Then, for each t € (0,00), 


(7.72) Wo € H™4(D) => lim BS hig, _ e 'Xwo, 


v—0 


in H®4-norm. 


Proof. Given wo € H%4(D), (7.70) implies {e’4-) wo : v € (0,1]} is 
bounded in H7-4(D) for each t € (0,00), so there is a weak” limit point. But 
Proposition 7.6 yields convergence to e~'* wo in L4-norm, so e~'* wo is the only 
possible weak* limit point. Norm convergence in H7’“(D), for each r < a, then 
follows from the compactness of the inclusion H%4(D) — H™4(D). Taking 
o' > o such that o’q < 1, the argument above yields e“’4-*) wy + e~'* wo 
in H°:4-norm for each wo € H @9(D), The conclusion follows by denseness of 
H?"-4(D) in H®4(D), plus the uniform operator bound (7.70). 


This concludes our treatment of Ri(v,t,x). As mentioned, more precise 
results, including boundary layer analyses, are given in [MT1] and [MT2]. 
We move to an analysis of R3(v, t, 2) in (7.48), ie., 


d 
a0 ° 


t 
. O 
(7.73) R3(v,t, 2) iy elt-s)YA-X) (5. _ gv 
0 
where w” solves (7.34) and (7.36). Note that 0/00 commutes with X, X,, and 
A, so 2” (t,x) = Ow” /00 solves 


Oz” 


(7.74) = 


= (vA—X,)z", 2” 


The maximum principle gives 


Ie me 


(7.75) a (3) cco Ss | 00 liens 


Since the semigroup e’(”4—~*) is positivity preserving, we have 
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t 
(1.76) |Ra(ost,2)| < [owolla~ fe 92-™ |so( lal) ~ 8”(s lel) ds 
0 


Also, by radial symmetry, 


(7.77) gir 2 VARS) 3 = s”| = Po cmaa call — 3 , 
so 
t 
(7.78) Relate = Nejawellns | evt-IA/ 5. — 3/ ds, 
0 
where 
(7.79) 80(x) = so(|2|), a” (t, i) = a(t; |x|), 


and, we recall from (7.41), 
(7.80) v(t, 2) = s’(t, |a|)z+, vo(x) = so(|x|)at. 
Turning these around, we have 
y 1 Vv al 1 al 
(7.81) s*(t,|z|) = lee” (t,2)-2~,  89(|2|) = eo”) a, 
and also, if {e1, e2} denotes the standard orthonormal basis of R?, 
Vv 1 Vv 
§ (t,r) =U (t,re1) *€2 
1 
(7.82) = a ——v" (t, roe) - eg da 
0 oO 
1 
-| €2+ Ve,v" (t, rae) do, 
0 


and similarly 


1 
(7.83) so(r) = | €2 + Ve, vo(t, roe1) do. 
0 


The representation (7.81) is effective away from a neighborhood of {x = 0}, 
especially near 0D, where one reads off the uniform convergence of s”(t,7) to 
So(7) except on the boundary layer discussed above in the analysis of v’(t,-) > 
vo(-), given v9 € C*(D). 

The representation (7.82)-(7.83) is effective on a neighborhood of {x = 0}, 
for example the disk D,/2 = {x € R? : || < 1/2}, and it shows that s”(t,r) > 
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so(r) uniformly on r < 1/2 provided v’(t,-) —> vo(t,-) in C!(Dy,2). Results 
from Chap. 6, § 8 (cf. Propositions 8.1-8.2) imply one has such convergence if 
vg € C1(D), and in particular if vp € C°(D). 

Furthermore, the maximum principle implies 
(7.84) oA sg oe" |< hie * lg 8 |, 
where h,, is the free space heat kernel, given (with n = 2) by 


(7.85) hii (x) = (Anvt)~"/2e— lel? /avt 


and |S  — 8”| is extended by 0 outside D. We hence have the following boundary 
layer estimates on Rs. 


Proposition 7.8. Assume vo, wo € C®(D). Then, given T € (0,00), we have a 
uniform bound 


(7.86) |R3(v, t, z)| < C, 
fort € [0,7], v € (0,1), x € D. Furthermore, as v + 0, 
(7.87) R3(v,t,x) > 0 uniformly on DU Gis 


as long as 


w(v) 
7 


We recalls = {x € D: dist(x,0D) < 6}. 


(7.88) — oo. 


Among other results established in [MT1]-[MT2], we mention one here. For 
k EN, set 


(7.89) 
VA(D) ={f € L°(D): Xj,--+Xy,f € L?(D), VE< k, X;,, € £'(D)}, 


where X!(D) denotes the space of smooth vector felds on D that are tangent to 
OD. After establishing that 


(7.90) fev'(D) => lim e4f¢ =f, in V*-norm, 
and 
(7.91) feve(D) => lim, etVA-X) ¢ — eo *X # in V*norm, 


these works proved the following (cf. [MT2], Proposition 3.10). 
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Proposition 7.9. Assume vg € C%°(D) and wo € C'(D). Take k € N and also 
assume wo € V*(D). Then, for each t > 0, as v \, 0, 


(7.92) v’(t,-) 3 v9 and w"(t,-) > w(t,-), in V*-norm. 


Such a result is consistent with Prandtl’s principle, that in the boundary layer 
it is normal derivatives of the velocity field, not tangential derivatives, that blow 
up as v —> 0. We mention that the convergence of R2(v,t,x) to 0 in V*-norm 
follows from the analysis described in (7.50), and the convergence of R,(v, t, x) 
to 0 in V*-norm, given wo € C%(D), follows from a parallel analysis, carried 
out in [MT2], but not here. The convergence of R3(v, t, x) to 0 in V*-norm, given 
U0, Wo € C% (2), does not follow from the results on Rg established here; this 
requires further arguments. 

For further work on boundary layer theory, with an emphasis on channel flows 
and pipe flows, see [MNW], [HMNW], and [MM]. 

The two cases analyzed above are much simpler than the general cases, which 
might involve turbulent boundary layers and boundary layer separation. Another 
issue is loss of stability of a solution as v decreases. One can read more about such 
problems in [Bat, Sch, ChM, OO], and references given there. We also mention 
[VD], which has numerous interesting illustrations of fluid phenomena, at various 
viscosities. 


Exercises 


1. Verify the characterization (7.12) of vector fields of the form (7.11). 
2. Verify the calculation (7.13)-(7.14). 
3. Produce a proof of (7.90), at least for k = 1. Try for larger k. 


8. From velocity field convergence to flow convergence 


In §7 we have given some results on convergence of the solutions u” to the 
Navier-Stokes equations 


oe +V yu + Vp" =vAu” on IxQ, 
(8.1) at 
div u” = 0, ti | es =0, u”(0) = U0, 


to the solution wu to the Euler equation 


Ou 
Ot UU = Q, 
(8.2) rae u+Vp=0 onlIx 


divu=0, w|/0Q, u(0) = uo, 
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as \, 0 (given div up = 0, Uo || 02). 

We now tackle the question of what can be said about convergence of fluid 
flows generated by the t-dependent velocity fields wu” to the flow generated by 
u. Given the convergence results of § 7, we are motivated to see what sort of flow 
convergence can be deduced from fairly weak hypotheses on u, uw”, and the nature 
of the convergence u” — u. We obtain some such results here; further results can 
be found in [DL]. 

We will make the following hypotheses on the t-dependent vector fields 
u and u”. 


(8.3) u € Lip([0, 7] x Q),  divu(t)=0, u(#) || 09, 
(8.4) u’ € Lip([e,T] xO), Ve > 0, divu’(t)=0, u(t) || 0g, 
(8.5) u” € L*([0,T] x Q). 
Say v € (0, 1]. Here Q is a smoothly bounded domain in R”, or more generally 
it could be a compact Riemannian manifold with smooth boundary 02. We do not 


assume any uniformity in v on the estimates associated to (8.4)-(8.5). 
The field wu defines volume preserving bi-Lipschitz maps 


(8.6) gp? :0 30, s,¢ € [0,7], 
satisfying 

0 t,s t,s 8,8 
(8.7) g(x) =ult,p"(x)), v*(a) =a. 


at 


Similarly the fields w” define volume preserving bi-Lipschitz maps 


(8.8) pet: 030, s,t€ (0,T], 
satisfying 

(89) Sia) = we oh"(@)), h(a) =a. 
Note that 


Rae gop? =p", 1,s,t € [0,T], 
, pio ph? = gh", r,s,t € (0,T). 


Our convergence results will be phrased in terms of strong operator conver- 
gence on L?(Q) of operators S%° to S“°, where 
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S* fo(x) = foly*"(x)), s,¢ € [0,7], 


(8.11) t,s st 
SY fo(2) = foler (a), 8,t€ (0,7), 


and S®° (as well as y®') will be constructed below. These operators are also 
characterized as follows. For f = f(t, x) satisfying 


0 
(8.12) or oe —Vuty f(t), te (0, T], 
we set 
(8.13) S* f(s) = f(t), s,t € [0,7]. 


Note that f is advected by the flow generated by u(t). Clearly 


(8.14) 
S's: L?(Q) —+ L?(Q),  isometrically isomorphically, Vs, t € [0,7]. 


Similarly, for f” = f”(t, x) solving 


of” 
a — v Ei 
(8.15) ai Vue f" (Ot), 
we set 
(8.16) So? f"(s) = f’(t), s,¢ € (0,7), 
and again 
(8.17) 
Sis: LP(Q) —+ L?(Q), — isometrically isomorphically, V s,t € (0,7). 

Note that 

SS = Sra be |0; 7), 
(8.18) 


SH9Se" = St r,s, € (0,7). 


We will extend the scope of (8.16) to the case s = 0. Then we will show that, 
given (8.3)-(8.5), p € [1,00), t € [0,7], fo € L*(Q), 


u” > u in L}((0,T], L7(Q)) 


8.19 
vee => S*° fy > Sb? fo in L?-norm. 


In light of the relationship 


(8.20) Sy” fo(z) = fo(ye"(x)), 
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which will be established below, this convergence amounts to some sort of con- 
vergence 


(8.21) oy — yo" 


for the backward flows yp". 
To construct S%° on L?(Q), we first note that 


€,6 S (0,7), fo € Lip(Q) 


(8.22) a ; 
=> ||S2* fo — follnee < ||u" zl folluiple — 4], 


which in turn implies 


|Sé° fo — SE folln~ = ||S2° (SE? fo — fo)lln~ 
(8.23) = ISS” fo — follt- 


< |lu’||z~ |] folluip « le — 4]. 
Hence 


: te — et,0 
(8.24) ie SF fo — oy fo 


exists for all fo € Lip(Q), convergence in (8.24) holding in sup-norm, and a 
fortiori in L?-norm. The uniform boundedness from (8.17) then implies that (8.24) 
holds in L?-norm for each fo € L?(Q), as long as p € [1, 00), so Lip(Q) is dense 
in L?(Q). This defines 

(8.25) Se: LP(Q) 3 LQ), 1<p<oo, te (0,7), 


and we have 


(8.26) Si? folln» = lim ||SP* follze = || follz», 
E\,0 


so S%° is an isometry on L?(Q) for each p € [1, 00). 
We note that, parallel to (8.22), for c,d € (0,T], 7 € Q, 


(8.27) dist(y?* (x), ) < |lu"|| z= - le — 4], 
and, parallel to (8.23), if also t € (0, T], 


dist(yi*(x), o)"(x)) = dist( ys? (y"(«)), y?"(a)) 


(8.28) i 
< |lu" lz - |e — 4]. 


It follows that 
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(8.29) eo" (a) = lim g"(2) 
E\O 


exists and y?* :  — ( continuously, preserving the volume. Furthermore, we 
have 


(8.30) Sy fol) = fol yr" (a), 
first for fo € Lip(Q), then, by limiting arguments, for all fg € C(Q), and fur- 
thermore, for all fo € L?(Q), in which case (8.30) holds, for each t € (0, 7], for 


a.e. 2, i.e., (8.30) is an identity in the Banach space L?(Q). 
We derive some more properties of St0, Note from (8.23)-(8.24) that, when 


fo € Lip(Q), ¢ € (0,7), 
(8.31) 32° fo — SE foll zc < ||u” ||| folluip -, 


and hence, by uniform operator boundedness, for each fo € L?(Q), p € 
[1,00), s,¢ € (0,7), 


SP fo = lim Sé* fo (in L?-norm) 

éE\,0 
(8.32) = lim S'’*S5* fy (by (8.18)) 

éE\,0 

= 87°85 fo. 

Hence, S3*S*%° = S%° on L?(Q), Vs,t € (0, T], or equivalently, 
(8.33) Shtgi® — $99 on LP(M). 
We also have from (8.23)—(8.24) that 
(8.34) fo € Lip(Q) = ||S7° fo — folla~ < llu”Ilz~llfollzip 8 
and hence, again by uniform operator boundedness and denseness of Lip(Q), 


(8.35) lim S>" fy = fo in L?-nom, V fo € L?(Q). 


We now want to compare S“° fo with S!° fo. To begin, take 
(8.36) fo € Lip(Q), f(t) = 8" fo. 
Then f(t) satisfies 


) 
83) P= Ve fl) + Vurw-un fl, 10) = fo 
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so Duhamel’s formula gives 


t 

(8.38) fQ) = 87° fot | SEV una adeay (8) ae: 
0 

Now, by hypothesis (8.3) on u, we see that, for s € [0, T], 


fo € Lip(Q) = Il f(s)|luip < All follip 


(8.39) v 
= |Vur(s)-u(s) f(8)| < All folluip|u”(s) — u(s)]. 
Hence 
fo€ Lip(Q) => |S°° fo _ So follze 
t 
(8.40) < | ]S5° Vw (s)—u(syf (8) || be ds 


t 
< Allfolltip | lu" (a) — ule) lla» ds. 


Hence, given p € [1, 00), t € [0,T], 


u’ >u in L*((0,T], L?(Q)) 


8.41 
— => S*° fy > S*° fy in L?-norm, 


for all fo € Lip(Q), and hence, by the uniform operator bounds (8.26) and dense- 


ness of Lip(Q) in L?(Q), we have: 


Proposition 8.1. Under the hypotheses (8.3)-(8.5), given p € [1, 00), t € [0,T], 
convergence in (8.41) holds for all fo € L?(Q). 


In fact, we can improve Proposition 8.1, as follows. (Compare [DL], 
Theorem II.4.) 


Proposition 8.2. Given p € (1,00), t € [0,T], 


coe fo € L?(Q), u” > u in L*([0,T], L'(Q)) 
——s So" fy > St fy in L®-norm. 

Proof. By Proposition 8.1, the hypotheses of (8.42) imply 

(8.43) Si fo + S”” fo 


in L'-norm. We also know that 


(8.44) Ise? follne = ||S"° follze = Ilfollze. 


644 17. Euler and Navier-Stokes Equations for Incompressible Fluids 


for each vy € (0, 1], ¢ € [0, 7]. These bounds imply weak” compactness in L?(Q), 
and we see that convergence in (8.43) holds weak” in L?(Q). Then another use of 
(8.44), together with the uniform convexity of L?(Q) for each p € (1, 00) gives 
convergence in L?-norm in (8.43). 


A. Regularity for the Stokes system on bounded domains 


The following result is the basic ingredient in the proof of Proposition 6.2. 


Assume that 2 is a compact, connected Riemannian manifold, with smooth 
boundary, that 


we H(O,T*), fe L*(O,T*), pe LO), 
and that 


(A.1) —Au=f+dp, du=0, 0. 


Uog = 


We claim that u € H?(Q,7*). More generally, we claim that, for s > 0, 
(A.2) f € H°(0,T*) = we H**7(9,T*). 


Indeed, given any  € [0, co), it is an equivalent task to establish the implication 
(A.2) when we replace (A.1) by 


(A.3) (A—A)u=f+dp, du=0, 0. 


Uag = 


In this appendix we prove this result. We also treat the following related prob- 
lem. Assume v € H1(Q,T*), p € L?(Q), and 


(A.4) (A—A)v=dp, dv=0, Via, = 9. 
Then we claim that, for s > 0, 
(A.5) g € H**3/2(90, T*) => v € H*+?(0,T"). 


Here, for any x € £ (including x € OQ), T* = T*(Q) = T* M, where we take 
M to be a compact Riemannian manifold without boundary, containing 2 as an 
open subset (with smooth boundary OQ). In fact, take M to be diffeomorphic to 
the double of ©. 

We will represent solutions to (A.4) in terms of layer potentials, in a fash- 
ion parallel to constructions in § 11 of Chap.7. Such an approach is taken in 
[Soll]; see also [Lad]. A different sort of proof, appealing to the theory of sys- 


A. Regularity for the Stokes system on bounded domains 645 


tems elliptic in the sense of Douglis—Nirenberg, is given in [Tem]. An extension of 
the boundary-layer approach to Lipschitz domains is given in [FKV]. This work 
has been applied to the Navier-Stokes equations on Lipschitz domains in [DW]. 
Here the analysis was restricted to Lipschitz domains with connected boundary. 
This topological restriction was removed in [MiT]. Subsequently, [Mon] produced 
strong, short time solutions on 3D domains with arbitrarily rough boundary. 

Pick A € (0,00). We now define some operators on D’(/), so that 


(A.6) (A— A)® —dQ=IonD'(M,T*), 6®@=0. 
To get these operators, start with the Hodge decomposition on /: 
(A.7) d6G+6dG+P,=I onD'(M,A*), 


where FP, is the orthogonal projection onto the space of harmonic forms on M/, 
and G' is A~' on the orthogonal complement of .. Then (A.6) holds if we set 


tess © = (A— A)71(6dG + P,) € OPS~?(M), 

, Q=-dG€ OPS-1(M). 

Let F(x, y) and Q(x, y) denote the Schwartz kernels of these operators. Thus 
(A.9) (A—A,)F(a2,y) — dzQ(a,y) = dy(x)I, b2F (x,y) = 0. 


Note that as dist(x, y) — 0, we have (for dim Q = n > 3) 


F (x,y) ~ Ao(a, y) dist(x, y)* aot en, 


(A.10) 7 
Q(z, y) ad Bo(2,y) dist(a, y) ee ’ 


where Ao(Exp,v,y) and Bo(Exp,v,y) are homogeneous of degree zero in 
véT,M. 
We now look for solutions to (A.4) in the form of layer potentials: 
(a) = f F(e,y) wly) aS(y) = Fula), 
(A.11) oe 
p(x) = f Q(e.y)-w(y) aS{y) = Qu(a), 
0a 


The first two equations in (A.4) then follow directly from (A.9), and the last equa- 
tion in (A.4) is equivalent to 


(A.12) Uw = 49, 
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where 

(A.13) Wu(x) = f F(e,yw(y) aS(y), 09, 
e1@) 

defines 

(A.14) VW <¢ OPS~'(90,T"). 


Note that W is self-adjoint on L?(0Q, T*). The following lemma is incisive: 
Lemma A.1. The operator W is an elliptic operator in OP S~*(OQ). 
We can analyze the principal symbol of W using the results of § 11 in Chap. 7, 


particularly the identity (11.12) there. This implies that, for x € OO, € € 
T, (OQ), v the outgoing unit normal to OQ. at x, 


(A.15) ow(2,€) = Cy a oo(x,€ + Tv) dr. 


From (A.8), we have 


(A.16) oa(t, CB =|C\ tue Ac B, 6 BETIM. 


This is equal to |¢ mee 3, where Pe is the orthogonal projection of 7’* onto 
(¢)+. Thus 


(A.17) g(x, 0)8 = A(C)B — B(Q)B, 
with 


A(Q)8 = |01-78,  B(C)B = Ie] *(8- OC. 


Hence 
(A.18) / A(E+ Tv) ar = f (Ig? +77)~* dr =m |€\~*, 
with 
ie 
a _«1t+7? 
Also 


- =| cer 7) [(g- T7(B-v)v) dr 
(A.19) [Be+ms a= (le +7)" [(8-)€+72(8-v)v] d 
= ylé|“3(8- OE + yl€l-1(8 - vv, 
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with 

love) 1 [oe) 72 
A.20 = d — dr. 
oo ma [. (2 z (r+ 722 7 
We have 
(A.21) ow(2,€) = Cnlé|* [mI — 2Pe -— Pi], 


where Pz is the orthogonal projection of T> onto the span of €, and P,, is similarly 
defined. Note that y2 + 73 = 71, 0 < y;. Hence 0 < y2 < y and0 < ¥3 < 1. 
In fact, use of residue calculus readily gives 


TT 
Y=, Y2=7F> 3! 
4 4 


Thus the symbol (A.21) is invertible, in fact positive-definite. Lemma A.1 is 
proved. 
We also have, for any o € R, 


(A.22) W : H7(00,T*) —+ H°*1(00,T"*), Fredholm, of index zero. 


We next characterize Ker UY, which we claim is a one-dimensional subspace of 
C™ (dQ, T*). 

The ellipticity of Y implies that Ker W is a finite-dimensional subspace of 
C™(00,T*). If w € Ker W, consider v = Fw, p = Qu, defined by (A.11), on 
QUO (where O = M \ Q). We have (\ — A)v = dp on Q, 6v = 0 on Q, and 
v| sa 0, so, since solutions to (A.3) are unique for any > 0, we deduce that 
v = 0onQ. Similarly, v = 0 on O. In other words, 


(A.23) ®(wo) =0 on QUO, 

where o is the area element of 09, so wo is an element of D’(M,T™*), sup- 
ported on OQ. Since 6 € OPS~?(M), ®(wa) € C(M,T*), so (A.23) implies 
®(wo) = 0 on M. Consequently, by (A.6), 

(A.24) wo = dQ(wa) on M. 

The right side is equal to ddG(wo) = Pa(woa). It follows that d(wo) = 0, which 
uniquely determines w, up to a constant scalar multiple, on each component of 
OQ, namely as a constant multiple of v. It follows that 


(A.25) weée KerV =} wo=Cdyq, 


for some constant C, assuming 2 and O are connected. In our situation, O is 
diffeomorphic to 2, which is assumed to be connected. 
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Consequently, whenever g € H*+3/?(8Q, T*) satisfies 
(A.26) ‘Kc v) dS =0, 
0a. 


the unique solution to (A.4) is given by (A.11), with 

(A.27) w € H°t+1/2(90,T*). 

Note that if dv = 0 on Q and v| aq = 9, then the divergence theorem implies that 
(A.26) holds. Thus this construction applies to all solutions of (A.4). 


Next we reduce the analysis of (A.3) to that of (A.4). Thus, let f € H*(0,7%). 
Extend f to f € H*(M,T*). Nowlet u; € H**?(M,T*), p; € H*®*1(M) solve 


(A.28) (A\—A)uy =f+dp,, duj=0 onM, 


hence u, = ® f and p= Qf. If u solves (A.3), take v = u— uy 
(A.4), with p replaced by p — 1, and 


ae which solves 


(A.29) g =|. € H°*8/7(80, T*). 


Furthermore, since du; = 0 on M, we have (A.26), as remarked above. 
We are in a position to establish the results stated at the beginning of this 
appendix, namely: 


Proposition A.2. Assume u,v € H+(Q,T*), f € L°(0,T*), p € L7(Q), and 
r > 0. If 


(A.30) (A—A)u=ftdp, du=0, ti a =0, 
then, for s > 0, 

(A.31) fe H7(0,T*) > ue B77 (0,T"), 
and if 

(A.32) (A—A)v=dp, dv=0, te =49, 
then, for s > 0, 

(A.33) g € H°+3/2(00, T*) => v € H*+?(Q,T"). 


Proof. As seen above, it suffices to deduce (A.33) from (A.32), and we can 
assume g satisfies (A.26), so 
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(A.34) v(a) = / F(x,y)w(y) dS(y), «2, 


where F'(x, y) is the Schwartz kernel of the operator ® in (A.6)-(A.8), and 
(A.35) g € H8*3/2(80, T*) => w € H*t/2(80,T"). 


Now V = dv satisfies 


ce 
(A368 (a) = im oft F(x’, y)w(y) dS(y) =Gu(2), «2 € da, 


where, parallel to Proposition 11.3 of Chap. 7, we have 
(A.37) G € OPS* (an). 


Hence (A.35) implies Gw € H*+1/2(90, A?T*). Now standard estimates for the 
Dirichlet problem (A.36) yield V € H*+1(Q) if w € H*+1/2(AQ); hence, if v 
satisfies (A.32), 


(A.38) Av=6V € H°(Q), \|,,=9, 


and regularity for the Dirichlet problem yields the desired conclusion (A.33). Thus 
Proposition A.2 is proved. 
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Einstein’s Equations 


Introduction 


In this chapter we discuss Einstein’s gravitational equations, which state that the 
presence of matter and energy creates curvature in spacetime, via 


(0.1) Gjx = 87K T jx, 


where G'j;, = Ricj, — (1/2)Sg;, is the Einstein tensor, T;, is the stress-energy 
tensor due to the presence of matter, and « is a positive constant. In § | we intro- 
duce this equation and relate it to previous discussions of stress-energy tensors and 
their relation to equations of motion. We recall various stationary action principles 
that give rise to equations of motion and show that (0.1) itself results from adding 
a term proportional to the scalar curvature of spacetime to standard Lagrangians 
and considering variations of the metric tensor. 

In § 2 we consider spherically symmetric spacetimes and derive the solution 
to the empty-space Einstein equations due to Schwarzschild. This solution pro- 
vides a model for the gravitational field of a star. After some general comments 
on stationary and static spacetimes in § 3, we study in § 4 orbits of free particles in 
Schwarzschild spacetime. Comparison with orbits for the classical Kepler prob- 
lem enables us to relate the formula for a Schwarzschild metric to the mass of a 
star. 

In § 5 we consider the coupling of Einstein’s equations with Maxwell’s equa- 
tions for an electromagnetic field. In §6 we consider fluid motion and study a 
relativistic version of the Euler equations for fluids. We look at some steady solu- 
tions, and comparison with the Newtonian analogue leads to identification of the 
constant «& in (0.1) with the gravitational constant of Newtonian theory. In §7 
we consider some special cases of gravitational collapse, showing that in some 
cases no amount of fluid pressure can prevent such collapse, a phenomenon very 
different from that predicted by the classical theory. 

In 88 we consider the initial-value problem for Einstein’s equations, first in 
empty space. We discuss two ways of transforming the equations into hyper- 
bolic form: via the use of “harmonic coordinates” (following [CBr2]), and via a 
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modification of the equation due to [DeT]. We then consider Einstein’s equations 
in the presence of an electromagnetic field, and in the presence of matter, with 
emphasis on the initial-value problem for relativistic fluids. 

In §§89 and 10, we consider an alternative picture of the initial-value prob- 
lem for Einstein’s equations, regarding the initial data as specifying the first and 
second fundamental forms of a spacelike hypersurface (subject to constraints 
arising from the Gauss—Codazzi equations) and discussing the solution in terms 
of the evolution of such hypersurfaces (as “time slices”). Such a picture has 
been prominent in investigations by physicists for some time (see [MTW]) and 
has also played a significant role in modern mathematical work, such as in 
[CBR, CK, CBY2]. 


1. The gravitational field equations 


According to general relativity, the presence of matter in the universe (a four- 
dimensional spacetime) influences its Lorentz metric tensor, via the equation 


(1.1) GI* = 8rKTI*, 


where « is a positive (experimentally determined) constant, G/" is the Einstein 
tensor, defined in terms of the Ricci tensor by 


a an ae 
(1.2) GIF = Ricd* — 559°", 


S = Ric? ; being the scalar curvature, and T! k is the stress-energy tensor due to 
the presence of matter. 

We will review some facts about the stress-energy tensor, introduced in 
Chap. 2, and show how the stationary action principle—as used in § 11 of Chap. 2 
to produce Maxwell’s equations for an electromagnetic field, and the Lorentz 
force law for the influence of this field on charged matter, from a Lagrangian—can 
be extended to a variational principle that also leads to (1.1). This cannot be 
regarded as a derivation of (1.1), from more elementary physical principles, but it 
does provide a context for the equation. We follow the point of view of [Wey]. 

In the relativistic set-up, as mentioned in §18 of Chap. 1, one has a four- 
dimensional manifold M with a Lorentz metric (g;~), which we take to have 
signature (—,+,+,+). A particle with positive mass moves on a timelike curve 
in M, that is, one whose tangent Z satisfies (Z, Z) < 0. One parameterizes such 
a path by arc length, or “proper time,” so that (7, Z) = —1. The stress-energy 
tensor T’ due to some matter field on / is a symmetric tensor field of type (0, 2) 
with the property that an observer on such a path (with basically a Newtonian 
frame of mind) “observes” an energy density equal to T(Z, Z). 
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For example, consider a diffuse cloud of matter. We will model this as a 
continuous substance, whose motion is described by a vector field wu, satisfying 
(u,u) = —1. Suppose this substance has mass density 4 dV, measured by an 
observer whose velocity is u. Suppose this matter does not interact with itself; 
sometimes this is called a “dust.” Then an observer measures the mass-energy of 
the moving matter. The stress-energy tensor is given by 


n~ 


(1.3) T=pu®u, ie, T7* = pw, 


where T is the tensor field of type (2,0) obtained from T via the metric, that is, by 
raising indices. 

For the electromagnetic field F, an antisymmetric tensor field of type (0,2) on 
M, in (11.34) of Chap. 2 we produced the formula 


1 ; HD seisi fon 
(14) TH = — (Fire — gi FHF). 


Also in § 11 of Chap. 2 we considered the equations governing the interaction 
of the electromagnetic field on a Lorentz manifold with a charged dust cloud, 
modeled as a charged continuous substance. We produced the Lagrangian 


=. ut 


(1.5) L=—— 


1 
(FF) + (A, TF) + solu, u) = Li + La + Ls. 
Here, F, 4, and u are as above. Part of Maxwell’s equations assert 
(1.6) dF =0, 
so, at least locally, we can write F = dA fora 1-form A on M, called the electro- 
magnetic potential. The vector field 7 is the current, which has the form 7 = cu, 
where o dV is the charge density of the substance, measured by an observer with 
velocity u. We assumed there is only one type of matter present, so o is a constant 
multiple of jz. Also we assumed the law of conservation of mass: 


(1.7) div(juu) = 0. 


We then examined the action integral 
(1.8) Au) = [4 dV 


and showed that, for a compactly supported 1-form 3 on M, 


d 
(1.9) qllA+78,4)| <0 = | [-Z@arr)+ (3,J)| dV. 
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Thus the condition that J(.A,u) be stationary with respect to variations of A 
implies the remaining Maxwell equation: 


(1.10) d®*F =4r7’, 


where .7° is the 1-form obtained from 7 by lowering indices. A popular way to 
write (1.9) is as 


(1.11) ) pea [[-prhet 76, dV. 


Furthermore, we showed that when the motion of the charged substance was 
varied, leading to a variation u(7) of u, with w = 0,u, compactly supported, then 


(1.12) “1(Au(r))I, 9 = — [ (Wu -F.5,w) dV, 

or equivalently, for the variation of the motion of the charged matter, 

(1.13) sft dV = [lake —~FI,,T*| wy dv. 

Then the condition that [(A, wu) be stationary with respect to variations of u is that 


(1.14) uV,u—-FI =0, 


the Lorentz force law. 

Having varied A and wu in the action integral, we next vary the metric. We claim 
that the variation of an action integral of the form (1.8) with respect to the metric 
is given by 


(1.15) 6 pry 5 | Tain) dV, 


where T/" is the stress-energy tensor associated with the Lagrangian L. We look 
separately at the three terms in (1.5). First, we consider 


1 ; 
(1.16) L3 = 5h, u), TE" = puu*. 


To examine the variation in [ L3 dV, it is necessary to recognize that y: depends 
on the metric, via the identity 1 dV = m dy ds, where m is constant. Thus 
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1 jk 
6] Ll3dV=6 zginw'u m dy ds 
1 ok 
(1.17) =s J whu(oan)m dy ds 
1 ik 
= 5 f Huu (Sax) aV, 


yielding (1.15) in this case. 
Next, consider 


oil 


(1.18) Ly an 


: 1 . Tag: 
(Pas fi a (Fier = 79 F"Fu). 


Now (F, F) = (1/2) FjxFieg?g*, and 


(1.19) 5( Fix Fug’ g") = Fin Fie[(59")o™ + 9°"(59")], 
while 

1 
(1.20) dV = y/|g| dx => 5(dV) = — ayn 6g" AV. 
Hence 


i . 1 
(1.21) -ar6 f Ly dV = an | (6g) [Fi Fee Fa Fej— 590F Fe] dV. 


Using 699* = —g!"(5go;)g'* and Fj, = —F;,;, which implies F; = —F*';, we 
obtain 


(1.22) —8n 5 ft dv =-5 (ax) lorie = 59 F*F| aV, 


which also yields (1.15). 
For the middle term in (1.5), namely, Lz = (A,.7) = (e/m)(A, u) pu, we have 


(1.23) 5 fag) av=5 f <(Au)y dV =0, 
m 
consistent with the standard choice of stress-energy tensor for the coupled system: 


h . 1 . Tgp wy 
(1.24) TH = puluk + — (FFM gt FYE ,). 
An 4 
As noted in Chap. 2, § 11, if the stationary conditions (1.10) and (1.14) are satis- 
fied, and (1.6)—(1.7) hold, then this tensor has zero divergence (i.e., THs = 0). 
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Now Einstein hypothesized that “gravity” is a purely geometrical effect. 
Independently, both Einstein and Hilbert hypothesized that it could be captured 
by adding a fourth term, L4, to the Lagrangian (1.5). The term L4 should depend 
only on the metric tensor on M, not on the electromagnetic or matter fields (or 
any other field). It should be “natural.” The most natural scalar field to take is one 
proportional to the scalar curvature: 


(1.25) fas. 


where a is a real constant. 
We are hence led to calculate the variation of the integral of scalar curvature, 
with respect to the metric: 


Theorem 1.1. /f M is a manifold with nondegenerate metric tensor (g;,,), a8so- 
ciated Einstein tensor Gjx = Ricjn—(1/2)Sg;x, scalar curvature S, and volume 
element dV, then, with respect to a compactly supported variation of the metric, 
we have 


(1.26) 6 Js dV = [Gn dg)* dV = - [ a Ogjr aV. 


To establish this, we first obtain formulas for the variation of the Riemann 
curvature tensor, then of the Ricci tensor and the scalar curvature. Let Ij, be 
the connection coefficients. Then dI’;, is a tensor field. The formula (3.54) of 


Appendix C states that if R and R are the curvatures of the connections V and 
V=V+eC, then 


(1.27) (R— R)(X,Y)u = e(VxC)(Y,u) — e(VyC)(X, u) + e?[Cx, Cy]u. 
It follows that 

(1.28) OR jee = OL jek — OT jase. 

Contracting, we obtain 

(1.29) 6 Riese = OF jig — OV jhe. 

Another contraction yields 

(1.30) gi® 5 Ricjx = (g?* 50) — (g’* Ost) g 

since the metric tensor has vanishing covariant derivative. The identities 
(1.28)-(1.30) are called “Palatini identities.” 


Note that the right side of (1.30) is the divergence of a vector field. This will be 
significant for our calculation of (1.26). By the divergence theorem, it implies that 
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(1.31) [ore Ric;,) dV = 0, 


as long as 6g;% (hence 6 Ric;,) is compactly supported. 
We now compute the left side of (1.26). Since S$ = g/* Ric;,, we have 


(1.32) 569 = Ricjx 697" + g?* 6 Ricjr. 
Thus, since 5(dV) is given by (1.20), we have 
6(S dV) = Ricjz dg dV + g?*(6 Ricj,) dV + S 6(dV) 

(1.33) . 1 a jk : 
=> (Ric, = 5 5.95%) 59 dV+g (5 Ric;,) dV. 
The last term integrates to zero, by (1.31), so we have (1.26). 

For some purposes it is useful to consider analogues of (1.26), involving varia- 
tions of the metric which do not have compact support (see, e.g., [Yo4] and [Yo5]). 

Note that verifying (1.26) did not require the computation of 61°’ ;, in terms of 


69;x, though this can be done explicitly. Indeed, formula (3.63) of Appendix C 
implies 


1 
(1.34) OP ejk = 3 [Seis — 6geK.5 + Sain] : 


From Theorem 1.1 together with (1.15), we see that if L is a matter Lagrangian, 
such as in (1.5), then the stationary condition for 


1 
(1.35) 5 f (58+ 8rnL) ay 
with respect to variations of the metric tensor, yields the gravitational equation 
(1.1). 


An alternative formulation of (1.1) is the following. Take the trace of both sides 
of (1.1). We have 


; » tae 
(1.36) Gi, = Rio!; — 5S9/; = (1 = =)s 
when n = dim M. Since n = 4 here, this implies 

(1.37) —S=8nKT, T=T;. 


Then substitution of —87«7 for S in (1.1) yields 


a 
(1.38) Ric = 8am (TH — srg). 
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We now derive a geometrical interpretation of the Einstein tensor. Let € = eg 
be any unit timelike vector in T,,M, part of an orthonormal basis {e€o, €1, €2, €3} 
of T,M, where (e;,e;) = +1 for j > 1, —1 for j = 0. From the definiton of the 
Ricci tensor, we obtain 


3 


3 
(1.39) Ric(é,é) = 5_(R(e;,6)£,e7) = — )_ K(ENe;), 


jel j=l 


where I(€ A e;) denotes the sectional curvature with respect to the 2-plane in 
T,M spanned by € and e;. Compare with Proposition 4.7 of Appendix C, but 
note the sign change due to the different signature of the metric here. Also, the 
scalar curvature of / at p is given by 


(1.40) S= > KlejAen.) (0<5,k <3). 
JAK 
Hence, for G = Ric — (1/2)Sg, we have 
1 
(1.41) G(E,€) = Rie(E,E) + 55= DY K(ej Nex). 


1<j<k 


Now let Ve be the spacelike hypersurface, formed by the geodesics through p 
normal to €. The second fundamental form of V¢ vanishes at p (see the proof of 
Proposition 4.7 in Appendix C, analyzing the sectional curvature). It follows that 
the scalar curvature of V¢ at p is given by 


(1.42) S(Ve)=2 S> K(e; Aen), 
1<j<k 


with K(e; A ex) as in (1.41). Hence 


(1.43) S(Ve) = 2G(E, £). 


Thus the gravitational equation (1.1) can be written as 


(1.44) S(Ve) = l6rKp, 
where 
(1.45) p=T(é,€) 


is the energy density, measured by an observer with 4-velocity €. Note that if T’ 
is given by (1.3), then T(€, €) = pu(u, €)”, which is nonnegative (in fact, positive 
where pp # 0). Also, if T is the stress-energy tensor (1.4) of the electromagnetic 
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field, then, as observed in calculations leading to (11.33) in Chap. 2, T(E, €) is the 
observed measurement of (1/87) (|E|? + |B|*), also nonnegative (and positive 
where the electromagnetic field does not vanish). Typically, a stress-energy tensor 
for ordinary matter has the property that 


(1.46) T(€,€) >0, for € timelike. 


If p > 0 at p, we see that S(V¢-) has the same sign as the constant k. Below we 
will argue that « is positive. 

Note that the equation (1.14) indicates that uncharged matter should move 
along geodesics. Let us consider the influence of the geometry on the rela- 
tive motion of nearby neutral particles, whose motion is along nearby timelike 
geodesics in M. Say one geodesic yo(s) has unit timelike tangent vector € = 
yo(s); (€,€) = —1. If there is a one-parameter family of geodesics y,(s), then 
W(s) = 0;77(s) = is a vector field along ‘yo that satisfies the Jacobi equation 


(1.47) VeVeW = R(E, WIE. 


See Exercise 10 in § 3 of Appendix C. 

Let us vary the geodesic yo in the following specific fashion. Let Vz be 
the hypersurface described above, spanned by geodesics through p = (0) 
with tangent vector in (€)+ C T,M. We extend € over Ve by radial paral- 
lel transport and, given q € Vé, close to p, consider the geodesic y satisfying 
(0) = q, 7(0) = € (see Fig. 1.1). If 7, is a one-parameter family of such 
geodesics, with W(s) = ar 77(s)| then 


T=0’ 


(1.48) W(0)L€€7,M and VeW(0) =0. 


Vx 


FIGURE 1.1 Nearby Timelike Geodesics 
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Note that (d/ds)(W,&€) = (VeW, &) and 


d2 


53 (W.8) = (V2W, ) = (RIE, W)E,6) = 0. 


Hence, if (1.48) holds, then W(s) L &(s) all along 7. 
Now we have (d/ds)(W, W) = 2(VeW, W), and 


e d 
(1.49) qe W) = 27,\VeW, W)= 2(VeW, W) + 2(VeW, VeW). 
Hence, at p, 
a 
(1.50) 3 (W,W) = —2(R(W, £)£, W). 


ds 


If we let W; be an orthonormal basis of T, (Ve), we obtain, via (1.39), 
RB 
' 1 
(1.51) 1 2 (W;, W;) = -2 Ric(€, €) = —16mK Te) +57], 


at p. 
Note that if T is given by (1.3), then 


(1.52) T(E, 6) + 57 = wlEw)? + 54a u) 2 0, 


when € and wu are both unit timelike vectors. In particular, T(u, u)+7/2 = y/2, in 
this case. Generally, a stress-energy tensor T is said to satisfy the “strong energy 
condition” if T(€,€) + 7/2 > 0 for all unit timelike €. For such stress-energy 
tensors, we have (at p) 


a 
iw) 


3 
a D(Wj,W;) <0 if x >0, 
(1.53) jt 
=0 if k=0, 
>0 ifk <0. 


Now, it is clear that an attractive gravitational force would make (1.53) < 0 (and 
in fact < 0 in a nontrivial matter field), while a repulsive force would make (1.53) 
> 0. Since we observe gravity to be attractive, we conclude that the constant « 
in (1.1) is > 0. Further discussion of the determination of « will be given in § 6, 
after (6.73)—(6.74). 
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Exercises 


1. If M is a Riemannian manifold of dimension 2, show that Gj, = 0. Deduce from 
Theorem |.1 that 


(1.54) [« dA = C(M) 
M 


is independent of the metric on /. Relate this to the Gauss—Bonnet formula established 
in § 5 of Appendix C, on connections and curvature. 
2. As shown in (3.31) of Appendix C, the Einstein tensor satisfies 


(1.55) Gi* . =0, 


as a consequence of the Bianchi identity. Hence, Einstein’s equations (1.1) imply 


(1.56) T* , =0. 
Compute this when T!* is given by (1.24). Compare with the calculation (11.54) of 
Chap. 2. 

3. A fluid, with 4-velocity field wu (satisfying (u,u) = —1), density p, and pressure p 


(measured by an observer with velocity w), has stress-energy tensor 
(1.57) Tyr = (p+ p)ujur + pgjr- 


Thus a dust, with the stress-energy tensor (1.3), is a zero-pressure fluid. Compute T?* ‘k 
in this case, and show that the conservation law (1.7) and the geodesic equation Vu = 
0 (which is (1.14) in the absence of an electromagnetic field) are modified to 


(1.58) div(pu) = —pdivu, (p+ p)Vuu = —II(u) grad p, 


where II(u) denotes projection orthogonal to u (with respect to the Lorentz metric), 
namely, in components, (II(w) grad p)? = p,4(g?* + uu*). 

4. Recall from (1.37) that when (1.1) holds, the scalar curvature of spacetime is given by 
S = —8nKT” ;. Deduce that if T* is given by (1.24), then 


(1.59) S = 81K, 
while if it is given by (1.57), then 
(1.60) S = 8rK(p — 3p). 


5. Show that if T9* is given by (1.3), then 
(1.61) Ric(u, u) = 47K, 


and if it is given by (1.24), then 


(1.62) Ric(u, u) = 4K + «(|E|? +|BI?), 


where F and B are the electric and magnetic fields, measured by an observer with 
4-velocity u, while if it is given by (1.57), then 


(1.63) Ric(u, u) = 4rK(p + 3p). 
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2. Spherically symmetric spacetimes and the Schwarzschild 
solution 


We investigate solutions to Einstein’s equations (1.1) which are spherically sym- 
metric. Generally, a Lorentz 4-manifold (/, g) is said to be spherically symmetric 
provided there is an effective action of SO(3) as a group of isometries of 1/7. The 
generic orbit O will be diffeomorphic to S*. We will assume that © is spacelike, 
that is, the metric induced on Q is positive-definite. Given p € O, let K, be the 
subgroup of SO(3) fixing p; K, is a circle group. Thus K,, acts as a group of 
rotations on 77,0, and it also acts on N,O = TO Since N,,O has a metric of 
signature (—1,1), and K,, acts on it as a compact, connected group of isometries, 
it follows that Kx, acts trivially on N,,O. 

On a neighborhood of O Cc M, diffeomorphic to (a, b) x (c,d) x $7, we can 
introduce coordinates so that the metric is 


(2.1) ds* = —C(r,t) dt? + D(r,t) dr? + 2E(r,t) dr dt + F(r,t) dw?, 


Where the functions C,D,£, and F are smooth and positive, and dw? is the 
standard Riemannian metric on the unit sphere, S? C R°. Assume that OF /Or # 
0 on O. Then we can change variables, replacing r by r’ = \/F'(r, t), and get the 
simpler form 


(2.2) ds? = —C(r,t) dt? + D(r, t) dr? + 2E(r, t) dr dt + r? du, 
with new functions Cr, t), and so on. Next, we can replace t by ¢’, such that 
(2.3) dt’ = n(r,t)|C(r, t) dt — E(r,t) dr], 


where 7)(r,t) is an integrating factor, chosen to be positive and to make the right 
side of (2.3) a closed form. Then the metric on / takes the form 


(2.4) ds? = —e"(™*) de? 4 eM) dp? + 4? du?. 


We take spherical coordinates (yp, 4) on S?, where y = 0 defines the north pole 
and » = 7/2 defines the equator. (Physics texts often give y and 6 the opposite 
roles.) Then 

dw? = dy” + sin? y dé”. 


The formula for the Einstein tensor G';, for such a metric is fairly complicated. 
Rather than just write it down, we will take a leisurely path through the calcu- 
lation, making some general observations about the Einstein tensor, and other 
measures of curvature, along the way. Some of these calculations will have fur- 
ther uses in subsequent sections. Among alternative derivations of the formula for 
the Ricci tensor for a metric of the form (2.4), we mention one using differential 
forms, on pp. 87-90 of [HT]. 
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The metric (2.4) has the general form 


(2.5) Gk = Ge +VGie, WEC(U), 


on a product M = U x S, where g” is the metric tensor of a manifold U, g° is 
the metric tensor of 5. To be more precise, if 


(a a") = (xo, ste te Fp Bes as ,LL4+M-1) EUxS, 


on is the metric tensor for U if 0 < 7,k < DL —1, and we fill in I to be zero 
for other indices. Similarly, we set 5% = hj+z.r+x for0 < j,k < M —1, where 
hjx is the metric tensor for S, and we fill in Fr to be zero for other indices. In the 
example (2.4), we have U C R?, S = $?,s0 L = M = 2. With obvious notation, 


jk jk -1 jk 

(2.6) f° =g7 +095 - 
We want to express the curvature tensor Ri gem of M in terms of the tensors 
URI pom and S Ri kem and the function wy and then obtain formulas for the Ricci 


tensor, scalar curvature, and Einstein tensor of 1. Recall that if I'%,, are the 
connection coefficients on //, then 


(2.7) R? itm = OV tem — Om pe +19 vel’ em — To ml’ re, 


where we use the summation convention (sum on v’). Meanwhile, 


1 
(2.8) Dein = 59°" [Dkgjn + OjGkn — Angin]: 


Using (2.5) and (2.6), we can first express i jk in terms of the connection coeffi- 
cients on the factors U and S: 


(2.9) Ti, = OT 5, + OT ia + Boje, 
where 

1 
Bein = 59° [G5 OV + Ge GY — Gx nV] 


1 
= [o*; On9 + 0°, O;9 — G5 ay). 


(2.10) 


Here we have set 


(2.11) v= logy, 
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and, in a product coordinate system such as described above, a j —lifj = Cis 
an index for S, 0 otherwise. We can write 


(2.12) of, = e(£)5*;, 

where e(f) = lif @ > L, e(€) = Oif 2 < L. Note that o°; produces a well- 
defined tensor field of type (1,1) on M =U x S, namely T, M = T,/U @ Ty, S, 
and o(2) is the projection of TM onto T,S, annihilating TU. 

Now, given p = (p’,p”) € M = U x S, let us use product-exponential 
coordinates centered at p, namely, the product of an exponential coordinate system 
on U centered at p’ and an exponential coordinate system on S' centered at p”. 
In such a coordinate system, we have 
(2.13) R? gem = 7 Ro xem + Reem + D gem, 
with 


(2.14) D9 gm = 0¢B? km — OmBi xe + Bi veBY em — BoimB" ke, at p. 


The formula (2.10) for B¢ jk is valid in any coordinate system. In product- 
exponential coordinates we have 


2.15) OB bm = 5 [05 B¢0m 9 + 0m B:0K9 ~ Gp, 00", 
at p. Also, 
On Bl ne = 5 [07 nO + 050 Oded — gf On O], 


at p, so 


OpB? pm — Om BY ke 
(2.16) 1. . . 
= 5 Fin O09 — 05 ¢ OmOKd — Gem OO W + G20 Om YI, 


at p. From (2.10) we have 


4 BIB’ km = of, OpD Om + Os mn Oed Od 


2.17 ; ; : 

, ~~ Oe Oy Omd _ Din Oy On) _ oa ¢(dv, dW) ems 
where 

(2.18) (dv, dw) = 0,0 O"w. 


Antisymmetrizing (2.17) with respect to and m, we have 
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ACB 6B” tye oS Be ii” ye) = (om Opd a oe Im) Op 
(2.19) + (Gem Oe9 — Gee Om) Ob 


+ (oF make — 07 Gem) (dd, deb). 
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We can produce a formula for D4 em Where each term is manifestly a tensor. 
To get this, first note that the tensor whose components in a product-exponential 
coordinate system are Op, is 0.e.~ — (1/2) (dw, dd) g5,. In fact, in any product 


coordinate system, 
Vek = OOn9 —T” cy OvV 
(2.20) = 0¢0¢0 — °T" ez 8,0 — BY ox 0,8 
1 
= DOK) + 5 (ah, dd) 9%, — UT’ ex Ov. 


The last identity follows from (2.10) plus the fact that we are summing only over 


vy < L. Then, from (2.16) and (2.19) we obtain 


— 


OP pigs = 2 [eo mda = OF 8 srr — Gem” e a ake sm] 
(2.21) 


in any product coordinate system. 
Contracting (2.13), we have 


(2.22) Rickm = Ric! + Rice, + Fem: Fhm = Diijm: 
Thus, in product-exponential coordinates centered at p € M, we have 
(2.23) Pym = 0; BY em = Om BI kj + BiB tem _ DP iB tas 


at p. We evaluate this more explicitly, using (2.16) and (2.19). 
Contracting (2.16) over 7 = £, we have 


OB On Beg 
1. . : ; : 
(2.24) =5 [09m O; KD — 09; Om OK9 — GFom FOV + GR; Om? VY] 
1 1 
= —=M dmO,8 — ~ 9% 
2 0 On 5 IkmEY, 


at p, where M = dim S and Ly = 0,077). Since y € C~(U), we have 


(2.25) Ly = SY) 0% = GG” O.Omb = Ou, 


jSL-1 


Lif2 : - - 
+ A [om nse = 07 (9.9m + Jem 9:0 = Geb Vom , 
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at p. Note that Ll is the Laplace operator on U. (For our case of primary interest, 
U has a Lorentz metric, so we use Hy rather than Ay.) Contracting (2.19) over 
j = £, we have 


BP Bite _ i ay 
(2.26) 1 


= — FM (On9)(Oe0) 7(M — 2)(d9, db) gem: 


Thus we obtain, at p, 


1 1 1 
Fam = — 5MOmdK9 — 5 (Qu) gem — GM (Im9)(Ie9) 


— 1(M — 2)(d8, db) gp. 


(2.27) 


_ 


To write this in tensor form, we recall the computation (2.20). Hence, in any 
product coordinate system, 


1 1 1 
(2.28) Fam = —5M(Y:k:m + 59:93) — 5 (Gud — (a8, dd) gm: 


The scalar curvature of M is S = g*™ Ricpm. By (2.22), we have 
(2.29) S=Sy+'Sst+8, B= 9"Fam: 


The formula (2.27) yields 


1 1 
B= — 5 Magi" OmOx9 — =Mb- "Oud 


(2.30) 


1 (M —2)My-1(ad, a), 


1 m 
— 5 Mal?" (Om) (Ox) — 5 


at p. Note that 


(2.31) gf” Om) = G5" Om (bOnp) = oOo — v7 98" (Om) (On), 


at p. Hence 


02.32) B= —My Oy — [(M? — 3M)v-? (dy), dv). 


As a check on this calculation, consider the simple case dim U = dim S = 1, 
with 


(2.33) g’ =da2, gi =dxt, g=dx2+ (20) de?. 
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Of course, Sy = Sig = 0 here, and (2.29) and (2.32) yield for the scalar curvature 
of M, 


(2.34) $= (00) (a0) + 520) (0)? 


Since S = 2K, K being the Gauss curvature of the two-dimensional surface M/, 
this formula agrees with the F = 1, G = wy case of formula (3.37) in Appendix 
C, for the Gauss curvature of a surface with metric E du? + G dv?. 

We now look at the Einstein tensor of M, Gj, = Ric; — (1/2)Sg;,,. In view 
of (2.22) and (2.29), we have 


1 
5 Su (aje + Y95x) 


1 1 
— 50 'Ss (G5, + Yo;4) — 5 Bain: 


Gyp = Rich, + Rich, + Fj — 
(2.35) 


Rearranging terms, we can write 
U s _1 S -1,U 1 
(2.36) Gyn = Gy, + Gh, - 3 Subgin +85 07.) + Fe 5 P9ik- 


Before considering the case dim S = 2, let us first consider the case dim U = 
dim S = 1. Then U and S are flat, and (2.36) becomes Gj, = Fyx — (1/2) 8g;x- 
Let us parameterize U and S by arc length. The case M = 1 of (2.27) is 


1 
2 


1 


(Gu) 9%, — 5 (89) (O0) + FU", dd) a6. 


Here, Oy = wW" (x0), 0j;0,0 = V0" (xo) for 7 = k = 0, 0 otherwise, and 
0,0 = W' for 7 = 0, 0 otherwise. Also, in this case (2.32) (or (2.34)) implies 
B= —y*y" + (1/2)p-?(u")?. It readily follows that Fj, = (1/2)89;x, hence 
Gx = 0. This is part of a more general result. 


Lemma 2.1. [f M is a two-dimensional manifold with a nondegenerate metric 
tensor, then its Einstein tensor always vanishes, Gj; = 0. 


Proof. Generally, Ric*,, is produced from RJ*,,,, via a natural map 

(2.37) K: End(A?T,, M) — End(T,M). 

Since 59* jn = (n — 1)6*,, we see that K(I) = (n —1)I, whenn = dim M. 
Now, if dim M = 2, then A?T,M is one-dimensional. Hence Ric” m must be a 


scalar multiple of b¥ 4» SO RiChn, must be a scalar multiple of g;,,. Comparing 
traces, we see that the multiple must be S/2, so 


1 
Ric;, = 9 °9ik when dim M = 2. 
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This precisely says that G;, = 0. Compare the derivation of (3.35) in 
Appendix C. 


Let us now consider the case dim S = 2. From (2.28), we have 


1 1 1 
2.38) Fy = —O.jn — 5(Cub)ghe + 5 (dd, dd) 95, — 59.79% 


in any product coordinate system. Contracting this, or alternatively taking the 
M = 2 case of (2.32), we have 


(2.39) B= —2) Cow + 502d, a). 


When dim S = 2, we have from Lemma 2.1 that G, = 0. 
If also dim U = 2, then Go. = 0, so (2.36) yields 


1 1 
(2.40) Gir = —5 (Suv oy, + Sob" 95x) + Fin — 58 9ins 


with Fj; and ( given by (2.38) and (2.39). 
Now, whenever a two-dimensional surface U has a metric tensor of the form 


(2.41) yo dae + 1 dai, 


with y; = 7;(20,21), we readily obtain from (2.8) the formulas for the connec- 
tion coefficients: 


Cr 1) 2 Ga O10 ) (ex .) os? ee i 
270 \O1¥0 —O071/) ’ - 27 \ On AN 

In the case (2.4), we have 

(2.42) y=-e", =e, 


where v and are functions of (uo, u1). This yields 


Cr ) _ 1 (av OV 
jk 2\Ov er ~"AA)’ 
1 fe’>d0\v AX 
upi_,\ + 1 0 
("Tie) =5 ( dp : 


Also, in the case (2.4), 2 = wW(r) = r?, where r = 21, and 0 = 2logr. 
Consequently, by (2.20), 


(2.43) 
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1 -1 S 
(2.44) 05:4 = OjOK0 + 5% (dy), dW) 95x — Wyk, 


where 0,0,0 = —2/r?, 0;0;0 = 0 for other indices, and 


il ea-vayy Oor 


for0 < j,k <1, w jx = 0 for other indices. 

Hence for the metric tensor (2.4), the 4 x 4 matrix (G,,) splits into two 2 x 2 
blocks: 
(2.46) Gin = Gin + Gir: 


The upper left block is 


~ 1 1 1 
CRE —zSs¥' 95k 595k OjOK9 + wie — 5(OjP) (nd) 


(2.47) ; 1 
= — 501 (Ss— Wed + so aah, dv)) of — wie, 


since, for 9 = 2logr, we have 0;0,0 + (1/2)(0;V)(Ox0) = 0. The lower right 
block is 


5 ( uv) 95 


= 5 (Suv uw 4 =u Naw, dv)) of. 


ny 


1 1 
Gir = — 5 Sub 9; = 50095 = 


(2.48) 


Thus, for metrics of the form (2.4), Gj, has 6 nonzero components, out of 16 
(or, if symmetry is taken into account in counting components, it has 5 nonzero 
components, out of 10). 

When the metric tensor of U has the form —e” dt? + e* dr?, the calculation of 
Gauss curvature in (3.37) of Appendix C gives 


Sy =e OH? [8, (vp?) + Be“ )/?)] 
(2.49) 


1 1 
=—¢ “y+ Ag— gurlur =e a relrt —1nH)e”. 


Here, \,, = OA, Ax = OA/Ot, and so forth. Of course, the unit sphere S? has 


Gauss curvature 1, so Sg = 2. Also, we have, for w(r) = r?, 


(2.50) uy = 2e > +r(v, — dre, w (dw, dw) = 4e>. 


The formulas (2.45), (2.49), and (2.50) specify all the ingredients in (2.47) and 
(2.48). 
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We conclude that, for a metric of the form (2.4), with (%0,71,2%2,%3) = 
(t,1r, y, 9), all the nontrivial components of G are specified by the following five 
formulas: 


evr ‘ 

(2.51) Gyo 3 (1—rA, —e*), 

rv 
(2.52) Goi = Gio = =, 

1 
(2.53) Gu= a (1 +rv,—e), 
il 2d 1, 1 1 

(2.54) Goo = 5° e€ (ver 5 Yr + ~ (Ur Ar) s¥rAr) 

_ 1 

=re (ru + 5 — 5): 

(2.55) G33 = sin? 2) Goo. 


Having determined the Einstein tensor for a spherically symmetric spacetime 
that has been put in the form (2.4), we now examine when the empty-space Ein- 
stein equation is satisfied, namely, when Gj, = 0. If we require all components 
Gj, to vanish, then (2.52) implies 0A/Ot = 0, or A = A(r). Furthermore, (2.51) 
and (2.53) imply 


(2.56) d,(\ +v) =0, 


or A(r) + (r,t) = f(t). Now, replacing t by t’ = y(t) has the effect of adding 
an arbitrary function of ¢ to v in (2.4), so we can arrange that v + \ = 0. Thus the 
metric (2.4) takes the form 


(2.57) ds? = —e”"") dt? + ec’ dr? + 7? dw?. 


Note that the coefficients are independent of t! We say the metric is static. The 
observation that a spherically symmetric solution to G = 0 must be static (under 
the additional hypotheses made at the beginning of this section) is known as 
Birkhoff’s theorem. 

For the metric (2.57), the component G'‘o,, given by (2.52), certainly vanishes, 
and Goo = 0 = Gy, if and only if 


(2.58) rv (r) =e") — 1, 


If we set w(r) = e”"), this ODE becomes rw (r) = 1—w(r), anonhomogeneous 
Euler equation with general solution w(r) = 1 — K/r. Hence 

Kk 
(2.59) v=, 


r 
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It remains to check that G22 vanishes for this metric, that is, that 
i / 2 2 / 
(2.60) v'(r) + u(r)? + -'(r) =0. 
Tr 


This is straightforward to check. Rather than substituting v(7), given by (2.59), 
into (2.60), we can differentiate (2.58) to get rv’ +-v' = —v'e~”; adding r(v’)? + 
v’ to both sides and again using (2.58), we obtain (2.60). 

We have derived the following metric, known as the Schwarzschild metric, 
satisfying the vacuum Einstein equation G = 0: 


K 
ie 
r 


Ky\-1 
(2.61) ds? =—(1- =) dP? + (1- =) dr? +1? dw”. 

r 
We can readily check that this is not a flat metric in a funny coordinate system, 
unless ky = 0. Indeed, by (2.13) we have R101 = URL 01, since D°j9, = 0 by 
(2.21). Now ¥ R°4 4, is a nonzero multiple of Sz, and, by (2.49), we have 


in this case. 

We have a solution to G = 0 upon taking any real x in (2.61), but the metrics 
most relevant to observed phenomena are those for which Kk > 0. Indeed, as will 
be seen in § 4, geodesic orbits for the metric (2.61) have the property that, for large 
r and small “velocity,” they approximate orbits for the Newtonian problem 


(2.62) & = —grad V(x), V(a) = a 
2 |x| 

If ky > 0, these are orbits for the Kepler problem, that is, for the two-body 

planetary motion problem. If K < 0, these are orbits for the Coulomb problem, 

for the motion of charged particles with like charges, hence for motion under a 

repulsive force. Repulsive gravitational fields have not been observed. 

Note that if we take K > 0 in (2.61), then the formula is degenerate atr = K. 
Only on {r > K} is 0/0t a timelike vector. It is this region that is properly said 
to carry the Schwarzschild metric. 

It follows from the fact that the sectional curvature of the plane spanned by 
O/Ot and 0/Or is Sy that the Schwarzschild metric (2.61) is singular at r = 0. 
On the other hand, the apparent singularity in the Schwarzschild metric at r = K 
actually arises from a coordinate singularity, which can be removed as follows. 
First, set 


K —1 
(2.63) v=t+ f (1) dr =t+r+Klog(r—K). 


Using coordinates (v,r, 0, y), the metric tensor (2.61) takes the form 
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FIGURE 2.1 Extended Schwarzschild Metric 


K 
(2.64) ds? = (1 ) dv? +2 du dr +1? du. 
r 


These coordinates are called Eddington—Finkelstein coordinates. The region 
{r > K} in (t,r,6, y)-coordinates corresponds to the region {r > K} in the 
new coordinate system, but the metric (2.64) is smooth and nondegenerate on the 
larger region {r > 0}. Note that if y, 6, and v are held constant and r \, K, then 
t Aco. 

The shell © = {r = K} is a null surface for the metric (2.64); that is, the 
restriction of the metric to & is everywhere degenerate. Thus, for each p € &, 
the light cone formed by null geodesics through p is tangent to © at p. Figure 2.1 
depicts the extended Schwarzschild metric, in Eddington—Finkelstein coordinates. 

The function v arises from considering null geodesics in the Schwarzschild 
spacetime for which w € S? is constant or, equivalently, considering null 
geodesics in the two-dimensional spacetime 


(2.65) ds? = (1 *) at? + (1 A) ar. 


On the region r > K, there are a family of null geodesics given by v = const. and 
a family of null geodesics given by u = const., where 


(2.66) u=t—r—K lg(r—K). 
The coordinates (u,7,9,y) are called outgoing Eddington—Finkelstein coordi- 


nates (the ones above then being called incoming), and in this coordinate system 
the Schwarzschild metric takes the form 
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K 
(2.67) ds? = (1 ~) du 2 du dr +r? dw. 


As above, the region {r > A} in (t, r, 0, y)-coordinates corresponds to the region 
{r > K} in the new coordinate system. 

The incoming and outgoing Eddington—Finkelstein coordinates yield two dif- 
ferent extensions of Schwarzschild spacetime. These two extensions were sewn 
together by M. Kruskal and P. Szekeres. As an intermediate step from (2.64) and 
(2.67) to “Kruskal coordinates,” use the coordinates (u, v, 9, y). In this coordinate 
system, the Schwarzschild metric becomes 


K 
(2.68) ds? = -(1 - =) du dv +1? dw?, 


where r is determined by 


(2.69) s@-wartK log(r — K). 


Now make a further coordinate change: 


(2.70) g= 5 ("2 = aes. os (e?/? r oe, 


Then, in the Kruskal coordinates (£,7, 6, y), the metric becomes 
(2.71) ds* = F(r,£)?(—dr? + dé?) + r(E,7)? dw?, 


where r is determined by 


(2.72) Pa = —(r—Ky)e"/*, 
and F’ is given by 
2 4K? —r/K 
(2.73) F@,éy =e , 
r 


Figure 2.2 depicts the extended Schwarzschild spacetime in Kruskal coordinates. 


Exercises 


1. Use Lemma 2.1 together with Theorem 1.1 to show that whenever M is a compact 
manifold of dimension 2, endowed with a Riemannian metric, the integrated scalar 


curvature 
i SdA 
M 
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ao r=0 we 


t= const \> L- 
| oe QA 


ceca r=0 mig, 


FIGURE 2.2 Kruskal Coordinates 


is independent of the choice of Riemannian metric on M/. How does this fit in with 
proofs of the Gauss—Bonnet theorem, given in § 5 of Appendix C? 
2. Suppose M is a manifold with a nondegenerate metric tensor. Show that 


dim M = 3, Ricj, = 0 => R'jxe = 0. 


(Hint: Show that the map (2.37) is an isomorphism when dim M = 3.) 
3. Suppose in (2.4) you replace S”, with its standard metric, by hyperbolic space H’, with 
Gauss curvature —1, obtaining 


(2.74) ds” = —e" dt? + & dr? +r? 9. 


Show that in the formulas (2.46)—(2.48) for the Einstein tensor, the only change occurs 
in (2.47), where Ss = 2 is replaced by —2. Show that this has the effect precisely of 
replacing —e* by +e* in the formulas (2.51) and (2.53) for Goo and G1. Deduce that 


a solution to the vacuum Einstein equations arises if \ = —v and 
K 
e’ =-1—-—, 
r 


for some K € R. Taking K > 0, we have a metric of the form 


=i 
(2.75) ds? (1 ! *) dt? (1 *) dr? +9 Qu, 
Tr r 


so r, rather than ¢, takes the place of “time,” and the Killing vector 0/0t is not timelike. 
Taking K = —k, & > 0, we have a metric of the form 


(2.76) ds? = (* 1) a+ (= 1)" dr? + on (r # k), 


r 
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and the Killing vector 0/0¢ is timelike on {r < kK}. 
Show that 


(2.77) —dr? + 77 gr 


can be interpreted as the flat Minkowski metric on the interior of the forward light cone 
in R®. What does this mean for (2.75)? 


3. Stationary and static spacetimes 


Let M be a four-dimensional manifold with a Lorentz metric, of signature 
(—1,1,1,1). We say M is stationary if there is a timelike Killing field Z on M, 
generating a one-parameter group of isometries. We then have a fibration M > S, 
where S is a three-dimensional manifold and the fibers are the integral curves of 
Z, and S inherits a natural Riemannian metric. We call S' the “space” associated 
to the spacetime //. 

Given x € M, let V, denote the subspace of T’, M/ consisting of vectors parallel 
to Z(x), and let H,, denote the orthogonal complement of V,,, with respect to the 
Lorentz metric on M. We then have complementary bundles V and H. Indeed, 
p : M — S has the structure of a principal G-bundle with connection, with 
G = R. For each x, Hz is naturally isomorphic to T),(.)S. The curvature of this 
bundle is the V-valued 2-form w given by 


whenever X and Y are smooth sections of H. Here, Po is the orthogonal 
projection of T;,M onto VY. Since G = R, this gives rise in a natural fashion 
to an ordinary 2-form on M. 

We remark that the integral curves of Z are all geodesics if and only if the 
length of Z is constant on M. This is a restrictive condition, which we certainly 
will not assume to hold. Thus such an integral curve C can have a nonvanishing 
second fundamental form II¢(X,Y), which for X,Y € V, takes values in H,. 
We have the following quantitative statement: 


Proposition 3.1. If Z is a Killing field and U, is a smooth section of H, then 
(3.2) (IIe(Z, Z), U1) = — Suz, Z). 

Proof. The left side of (3.2) is equal to 

(3.3) (VzZ,U1) = —(Z,VzU1) = —(Z, Vu, Z — £201). 


Now (Z,£2U1) = —(£Lzg)(Z, U1) = 0, so the right side of (3.3) is equal to the 
right side of (3.2), and the proof is complete. 
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Let Eo and FE; denote the bundles VY and H, respectively, so TM = Ep @ Fj. 
Let P;(x) denote the orthogonal projection of T,M onto E;,. Thus Po is as in 
(3.1). If V is the Levi—Civita connection on M, we define another metric connec- 
tion (with torsion) 
(3.4) V=V° ev}, 
where we = P;V x on sections of £;. Thus 


(3.5) Vx =PVxh+AVxPi = Vx -Cx, 


where C'x has the form 


0. Ti, 
(3.6) Ox= (70 7) 


as in (4.40) of Appendix C; C is a section of Hom(T’M © TM,T™M). Let us set 
(3.7) Tx =—-Cprx, Ax =—Cp,x. 

The Weingarten formula states that 

(3.8) Cx = —Cx; 


see (4.41) of Appendix C. Note that if x € C, an integral curve of Z, then 


(3.9) X,Y € Vy => CxY = IIe(X,Y). 


The following is a special case of a result of B. O’Neill, [ON]. It says that A in 
(3.7) measures the extent to which H is not integrable. 


Proposition 3.2. If X and Y are sections of H, then 
1 
(3.10) CxY = —5PolX, Y]. 


Proof. Since C is clearly a tensor, it suffices to prove this when X and Y are 
“basic,” namely, when they, arise from vector fields on /. Note that 


Po[X,Y] = PoVxY — PoVyX = AxY — AyX, 
so it suffices to show that Ax X = 0. If U is a section of V, then 


(U, AxX) = (U,V xX) = —(VxU,X), 
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where ( , ) is the inner product on T;,.%. Now, under our hypotheses, [X, U] is 
vertical, so (VxU, X) = (Vu X, X), hence 


1 
(U, AxX) = 5£u(X,X) =0, 


since (X, X) is constant on each integral curve C. 


Note that C'x is uniquely determined by (3.8)—(3.10), together with the fact 
that it interchanges V and H. 

We want to study the behavior of a geodesic on a stationary spacetime 7. We 
begin with the following result: 


Proposition 3.3. Let y be a constant-speed geodesic on M, with velocity vector 
T. If Z is a Killing field, then (T, Z) is constant on 4. 


Proof. We have 


d 
(3.11) ae (E (8) 2(0(8))) = (T, VrZ), 
if VrT = 0. Now generally the Lie derivative of the metric tensor g is given 
by (Lzg)(X,Y) = (VxZ,Y) + (X, Vy Z), so the right side of (3.11) is equal 
to (1/2)(£zg9)(T,T). Since Z is a Killing field precisely when Lzg = 0, the 
proposition is proved. 


Thus, if 7 is a geodesic on M, satisfying 


(3.12) (T,T) = Ca, 
we have 
(3.13) (T,Z) =C\. 


There is the following relation. Set 
(3.14) T=T4+T%=aZzZ+T, 


where To is a section of Y and 7} a section of H. Then, by orthogonality, Co = 
a?(Z, Z) + (T;,T1), while (T, Z) = a(Z, Z) = Cy, so 


Ci 


(3.15) Cy = (ZZ) 


+(T,T1), a= 


In Einstein’s theory, a constant-speed, timelike geodesic in MM represents the 
path of a freely falling observor. Let us consider the corresponding path in “space,” 
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namely, the path a(s) = po y(s), where p: M —> Sis the natural projection. We 
want a formula for the acceleration of o. 

Note that if y/(s) = T = To + T; = aZ + Tj, as in (3.14), then o’(s) = V(s) 
is the vector in T,,’,)S whose horizontal lift is 7) (s). By slight abuse of notation, 
we simply say V(s) = T,(s). Similarly, 

(3.16) VeV =PVernh, 


where P, is the orthogonal projection of T,,M on Hz, x = y(s). We can restate 
this, using a modification of the Levi—Civita connection V on M to V, given by 
(3.5). Then, via the identification used in (3.16), we have 


(3.17) VEV =Vrl = —CrTo, 
using VT = 0. In fact, this plus (3.5) yields 


VeV = Voth CrT, CrTo, 


where the first two terms on the right are sections of V and the last term is a section 
of H. Thus we get (3.17), plus the identity VrpTo = —C rT}. 

Consequently, if U; is a vector field on M, identified with a section of H on 7, 
we have 


(VEV, U1) = —(CryTo, U1) + (To, Cr,U1) 
(3.18) 


—(II¢(To, To), U1) — (To, uP, U;)). 


Here, IIc is the second fundamental form of the integral curve C of Z, and w is 
the “bundle curvature” of M — S, as in (3.1). The first identity in (3.18) makes 
use of (3.8), while the last identity follows from (3.9) to (3.10). 

Consequently, if we define wr, : H > V by wr, UV, = w(T1, U1), with adjoint 
wip, : V > H, we have 


1 
(3.19) ViV =—Ile(T, TM) — 50 (Ti, To); 


where w'(T1, 7) = wp, To. Note that the formula (3.2) for Zc can be rewritten 
as 


1 
(3.20) HIc(Z,Z) = 5 grad@, 6 = —(Z,Z), 


where (Z, Z) is a smooth function on M, constant on each integral curve C, hence 
effectively a function on S. Thus 


1 
grad ®, 


2 
(3.21) IIc(To, To) = a7 IIe (Z, Z) = a oD 
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where C| is the constant C = (7, Z) of Proposition 3.3. 
We can rewrite w'(T), 7) as follows. Let 8 : H — H be the skew-adjoint 
map satisfying 


(3.22) w(T),U;) = (B(T1), U1) Z. 
We then have 
(3.23) w*(T,T) = Ci B(T1), 
using the identity a(Z, Z) = C; from (3.15). Note that effectively @ is a section 
of End 7’S, that is, a tensor field of type (1,1) on S. 

In summary, recalling the identification of V and 7), we have the following: 
Proposition 3.4. If y~ is a constant-speed, timelike geodesic on a stationary 


spacetime M, then the curve 0 = po y on S, with velocity V(s) = o'(s), has 
acceleration satisfying 


1 1 
(3.24) VeV= 5Ci grad 1 — 5O3(V). 


Note a formal similarity between the “force” term containing G(V) here and 
the Lorentz force due to an electromagnetic field, on a Lorentz 4-manifold. Given 
initial data for y(s), namely, 


(3.25) 70) =%0€M, = ¥'(0) = T(0) = T(0) + T1(0), 

we have C = (T9(0), Z(ao)). The initial condition for o is 

(3.26) o(0) = p(%o), (0) = To(0). 

Conversely, once we obtain the path o(s) on S, by solving (3.24) subject to the 


initial data (3.26), we can reconstruct y(s) as follows. We define T on the surface 
= = p '(a) so that 


(3.27) p(x) = a(s) T(x) =a(s)Z+ V(s), 


with a(s) specified by the identity (3.15), namely, 


-1 


(3.28) a(s) = —C,®(o(s)) 


Then T is tangent to & and 7¥ is the integral curve of T' through xo. 

The Lorentz manifold / is said to be a static spacetime if the subbundle 1 is 
integrable, that is, the bundle curvature w of (3.1) vanishes. Note that if ¢ is the 
1-form on M obtained from Z by lowering indices, then 
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(3.29) (d¢)(X,Y) = X(Z,Y) —Y(Z, X) — (Z,[X,Y]). 
If X and Y are sections of H, this gives 

(3.30) (Z,[X, Y]) = —(dc)(*,Y), 


so vanishing of d¢ on H x H is a necessary and sufficient condition for inte- 
grability of H. As a complement to (3.30), we remark that, since Z is a Killing 
field, 


(3.31) (X,d®) = (d¢)(X, Z), 


for any vector field X on M, where, as in (3.20), ® = —(Z, Z). This follows 
from the identities 


(L26\(X) = (Lzg)(4,X), £20 = d(C] Z) — (dc) |Z. 
If M is static, a calculation using (3.30)—(3.31) implies 
(3.32) d(o-*¢) =: 
Hence there is a function t € C'°(M) such that 
(3.33) ¢=-® dt. 


It follows that the tangent space to any three-dimensional surface {tf = c} = S, 
is given by T,S. = Hp, for p € S_, and furthermore, the flow Ff generated by 
Z (which preserves 7) takes S, to S.44. Each S, is naturally isometric to the 
Riemannian manifold S, and the metric tensor on M has the form 


(3.34) ds? = —®(x) dt? + gs(dzx, dz), 


where ® is given by (3.20) and gg is the metric tensor on S. 

So, when M is static, we obtain a diffeomorphism UV : S x R > M by 
identifying S with Sp = {t = 0} and then setting U(x, t) = Fix. The geodesic 
yon M yields a path on S x R: 


(3.35) WY *(7(s)) = (o(s), t(s)), 


where o(s) is the path in S studied above. The function ¢(s) is defined by (3.35). 
Note that 


dt 
(3.36) a a(s), 
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where a(s) is given by (3.27)-(3.28). Thus we can reparameterize y by t, obtain- 
ing ¥(t) such that 7(t(s)) = 7(s). We see that 


(3.37) A(t) = (28), 2), 2G) =o(s). 


The quantities v(t) = x’(t) and a(t) = V2 v(t) are the velocity and accelera- 
tion vectors of the path a(t). We have 


(3.38) u(t) = a'(t) = ae) V(s) = rar a 
Furthermore, 
@? led 
Ss. s 
(3.39) Vee = aaViV — & (oe). 


Note that d®/dt = (v, grad ®). If we use (3.24), recalling that 3 = 0 in this case, 
we obtain the following result: 


Proposition 3.5. A static spacetime M can be written as a product S x R, with 
Lorentz metric of the form (3.34). A timelike geodesic on such a static spacetime 
can be reparameterized to have the form (3.37), with velocity v(t) = x'(t), and 
with acceleration given by 


(3.40) Veu= -5 grad ® + 5 (v. grad )v. 


By (3.15) we have (V, V) = C2 + C?/®, hence 
(3.41) (u,v) = 6+ 6", 


In particular, if y(s) is lightlike, so Cz = 0, we have 
(3.42) (u,v) =. 


This identity suggests rescaling the metric on 5S, that is, looking at g* = 6~!gg¢. 
We will pursue this next. 

Note that the null geodesics on a Lorentz manifold M (i.e., the “light rays’), 
coincide with those of any conformally equivalent metric, though they may be 
parameterized differently. This is particularly easy to see via identifying the 
geodesic flow with the Hamiltonian flow on T* M, using the Lorentz metric to 
define the total “energy.” If M is static, we can multiply the metric (3.34) by ®~}," 
obtaining the new metric 


(3.43) ds? = —dt? + g* (dz, dz), gt =O" 1gg. 
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If y is a geodesic for this new metric on M, the equation (3.40) for the projected 
path x(t) on S becomes 


(3.44) Viv =O, 


as the @ = 1 case of (3.40). Consequently, null geodesics in a static spacetime 
project to geodesics on the space S, with the rescaled metric g* = ®~ gs. 

Let us see what happens to geodesics that need not be lightlike. For the 
moment, we take M to be stationary, and define ® by (3.20). In order to clar- 
ify the role of the exponent of ®, we consider on S' a conformally rescaled metric 
of the form g#* = 6° gg. Farther along, we will again take a = —1, and then we 
will specialize to the case of M static. 

The connection coefficients for the two metrics gs and g* are related by 


BAS) AIM ce = ST ne + 55 (OD) He + (eB) He — 9!(Oy) ge) 


Equivalently, the connections V° and V* are related by 


Viw =VEW + (Wy, grad &) W + (W, grad &)V 
(3.46) 2 


—(V,W) grad °), 
In particular, 
(3.47) ViV =VEV+ a” grad &)V — sg Vs V) grad ©. 


Here, ( , ) is the inner product for gg, and grad ® is obtained from d® via the 
metric gg. 

If +(s) is a geodesic (not necessarily lightlike) on a stationary spacetime M, 
then the construction of the projected path o(s) on S given by o = po y shows 
that V = o’(s) satisfies 

Ci 
(3.48) VV) =O+ Sh, 


as noted after Proposition 3.5. Hence, in the lightlike case, g#(V, V) = ®*-1C?. 
If we want to reparameterize o to have constant speed (in the lightlike case), we 
set 


(3.49) a(r) = a(s), —=o0-9 


so g# (a', a’) = C? if y(s) is lightlike. Let 


3. Stationary and static spacetimes 687 
(3.50) w =6'(r) = BO-V/PY(s). 


Then (regardless of whether ¥ is lightlike) 


_ 
(3.51) Vitw = 0'-* V#V + 5b" (grad BLV)V. 
If we use (3.46), this becomes 


l-a Sy a -_ a 
a © (viv = (V, grad ®)V ~ =< (V,V) grad °) 


i 
+ SP "(grad ®,V)V. 


If we use (3.48) for (V, V) and (3.24) for VeV; we see that (3.51) is equal to 


Cc Cc 
ape grad & — so" (C2 + =) grad 


1 C 
ze oy, grad ®)V — 61" 9(V). 
From here on, we take a = —1. We have 
C. C 
(3.53) Vitw= a grad & — > Phu). 


This is a generalization of (3.44). If / is static, 6(V) = 0, and if + is lightlike, 
Cz = 0, so Vitw = 0 in that case. 

Note that if 1 is static, then w is a constant multiple of v, defined by (3.38), 
and (3.53) is then equivalent to 


C2 
(3.54) Vitv = —5 grad ©. 
2G? 


In (3.54), grad ® is obtained from d® via gg. If we use g*, call the vector field so 
produced grad* ®. We have 


grad* f = ® grad f. 
We have established the following: 
Proposition 3.6. Let M = S x R be a static spacetime, with metric of the form 
(3.34). Let y be a timelike geodesic on M, reparameterized to have the form 


(3.37), yielding a curve x(t) on S, with velocity v(t) = 2'(t). With respect to the 
rescaled metric g* = ®~'gg on S, this curve satisfies the equation 
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C2 
(3.55) Vitv = 32 grad* ®. 


Recall that C, and C2 are given by (3.12)—(3.13). We mention that, while the fac- 
tor C/ G? is constant on each orbit, it varies from orbit to orbit. For example, if 
at time to a particle moving along 7¥ is “at rest,’ so y/(so) is parallel to Z, where 
t(so) = to, then C2/C? = —®(a9)~', where x9 = 2(to), as follows from (3.15). 


Exercises 


Exercises 1-5 deal with the Kerr metric, which, in (¢, r, 0, ¢)-coordinates, is 


2 a) 
(3.56) ds? = -4 [dt—asin? yp d0]?+% dr?-+p? dig +" [(r?+a?) d0—a dt]’, 
where 
A=r—Kr+a’, p =r’ +a’ cos’ y. 


Here, K > 0 and a are constants. Note that the case a = 0 gives the Schwarzschild 
metric (2.61). 
1. Show that the Kerr metric provides a solution to the vacuum Einstein equation 


Gin =0. 


2. Show that 0/0¢ is a Killing field and hence that the Kerr metric is stationary. 

3. Show that the Kerr metric is not static (if a 4 0). Compute the 2-form w of (3.1). 

4. Contrast the metric induced by (3.56) on surfaces t = const. with the three-dimensional 
Riemannian metric constructed at the beginning of this section. 

5. Try to provide a “simple” derivation of the metric (3.56). 


4. Orbits in Schwarzschild spacetime 


We want to describe timelike geodesics in the Schwarzschild metric, which (for 
r > K) isa static, Ricci-flat metric of the form 


(4.1) ds? = —®(r) dt? +. ®"\(r) dr? +r? du®, @(r) =1- 


where dw? is the standard metric on the unit sphere $7. This is of the form (3.34), 
with ® depending only on r, and gs(dx,dx) = ®(r)~1 dr? + r? dw. Thus, by 
Proposition 3.6, a timelike geodesic (reparameterized) within the region r > K 
has the form 4(t) = (a(t),t), where v(t) = 2’(t) satisfies the equation (3.55), 
namely, 


(4.2) Viv =—o grad® 6, o =-—% 
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This involves the rescaled metric g# = &~'gg, which has the form 
(4.3) g* (dx, dx) = ©(r)~? dr? + ®(r) 1 r? dw. 


Symmetry implies that «(t) is confined to a plane, so we can restrict attention 
to the associated planar problem, with 


gt (dx, dx) = ®(r)~? dr? + ®(r)~1r? dé? 


44 
on = a(r)—* dr? + B(r)—' dé?. 


We use the method discussed in § 17 of Chap. | to treat this problem. We have a 
Hamiltonian system of the form 


OF OF 
4.5 4=—, M=—-—, m=0 
( ) Yj On,’ ™m Oy,’ 12 ; 
with 
1 2 1 2 
(4.6) Fy, 1,12) = yoy) + gA(y)n2 + o%(y1), 


where yi = r, y2 = 9. The first set of equations in (4.5) yields 


O(r) 


re? 


(4.7) * = O(r)?m, 6=L 


where L is the constant value of 72 along the integral curve of (4.5). Now 
F(yi1,m,L) = E is constant along any such integral curve. Solving for 7 and 
substituting into the first equation in (4.7), we have 


-®(r) (28 —2¢@(r) — 122) ‘a 


(4.8) r= 


We can rewrite this as 


bak =o. 


2Ko «6? KL? \ 1/2 
+=)", 


(49) %=+(r) (2+ ei aa 


Compare this with (17.16) of Chap. 1: 


(4.10) p= 


for the Kepler problem, with potential u(r) = —K/r. We have a shift in E, a 
correspondence Ko ++ K, and an extra term K L?/r?, within large parentheses 
in (4.9). 
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Next, setting u = 1/r, we have 


dr 2 du dé 


parallel to (17.21) of Chap. 1, and using § = L®(r)/r? we have 


dr du 
(4.12) Tz LO&(r) 70: 


hence, via (4.9), 


du 1 


ey 1/2 
(4.13) a=F7 (28 4 2Kou— Lu? + KI*u*) 


Compare the Kepler problem, where dr /dt = —Ldu/d0, and hence, from (4.10), 


du _ 1 9 9\V? 
(4.14) 5 7 (28 12Ku—L uw) 


It is useful also to consider a second-order ODE for u = u(0). Squaring (4.13) 
and taking the 0-derivative, we obtain (either u’(@) = 0 or) 


d?u Ko 3 
(4.15) Wee 7 U 


I 
i 
se 


Following [ABS], p. 207, we write this as 


d2 
(4.16) ape tua Aten. 
The € = 0 case arises from the study of the Kepler problem; cf. (17.24) of Chap. 1. 
A phase-plane analysis of (4.16) is useful. If v = du/d0, we have the 
“Hamiltonian system” 


du dv 2 
(4.17) 77 =v=0,F (u,v), 70 =A-ut+eu = —-0,F (u,v), 
where 
_1. 1. E 3 
(4.18) F(u,v) = 5 tau Au roe 


Of course, orbits for (4.18) lie on level curves F'(u, v) = E1, bringing us back to 
(4.13). See Figs. 4.1-4.4. In these four figures we have, respectively 
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3 3 1 1 
= A =, =<A -, Ae=-. 
e=0, 0< E< i@° 6 ~ e<7) E Z 


Also, in Figs. 4.2-4.3, we have 


1—V1—4Ae | aA 

Qe “da tA 
14+ V1 —4Ae 

Qe , 


B= 


We perceive the periodicity of u as a function of , on those level curves dif- 
feomorphic to the circle. The period is not 27, generally, if ¢ 4 0, so we have 
precession of the perihelion. 

Not all the closed orbits for (4.33) depicted in Figs. 4.24.4 correspond to 
bound orbits for the solution x(t) to (4.2). One mechanism behind this arises 
already in the e = 0 case. Take a level curve in Fig. 4.1 which crosses the vertical 
axis {u = 0}. Now u = 0 means r = ov, so as u > 0 along such an orbit, x(t) 
tends to infinity; this endures for an infinite span of time. Such situations also arise 
for positive values of e. 

In addition, in the case under consideration in this section, there is another 
mechanism at work. Namely, consider a level curve that crosses the vertical line 
{u = 1/K}, as in Fig. 4.5. From (4.9) we see that as u 7 1/K (sor \, &), 
t / oo. Now, in this case, it does not take the body “forever” to cross the thresh- 
old. When one switches to Eddington—Finkelstein coordinates (or to Kruskal 
coordinates), one can see the planet entering the zone r < Kk in finite “proper 
time.” The analysis of the geodesic within this region is not radically different 
from that done above, though there are some differences, since in this region the 
Killing field 0/0¢ is not timelike. We leave it to the reader to work out the nature 
of the orbit in this region, but note that indeed, a body crossing the threshold will 
not be able to exit. 

Let us look at the problem of determining the period p(<) of a solution w = 
w(e, @) to an ODE of the form 


2 
(4.19) a 4+w=ev(w). 


This is related to (4.16) by w = u— A, w(w) = (w + A)?. To specify uniquely 
a solution, let us take 


d 
(4.20) w(e,0) =a, 77 w(e,0) = 0. 


If v = dw/d@ (and we denote this by w), then we have the system 


(4.21) w=v, b= —wtev(w), 
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FIGURE 4.1 Orbits for (4.16), « = 0 


FIGURE 4.3 Orbits for (4.16), 4 < Ae < } 
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FIGURE 4.4 Orbits for (4.16), Ae = 4 


(VN 
\ 


FIGURE 4.5 Crossing the Threshold 


as in (4.17), and orbits lie on level curves of the function F:, given by 


(4.22) F.(w,v) = sO + sw —eV(w), WU(w)= [ew dw. 


It is clear that, for ¢ small, we have smooth, real-valued functions p(c,@) and 
v(e, 0), uniquely specified by 


(4.23) w + iv = ple, de), 


By (4.20), we have 
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(4.24) p(e,0) =a, y(e,0) = 0. 
Note also that 
(4.25) p(0,0) =a, (0,0) = 0. 
Then p(e) is a smooth function of ¢, satisfying 
(4.26) p(0) = 2n, y(e,p(e)) = 2m. 

We can derive a system of ODE for p, y, as functions of 9. The system (4.21) 
implies 

w+id =v —iw + iew(w) = —i(w + iv) + tew(w). 


Writing the left side as (d/d0)(pe~*”) = (p — i~p)e~*” and the right side as 
—ipe*? + iey(pcos py), we obtain 


p = —e(pcos) sin », 


(4.27) : 
p=1-—ep ‘w(pcos ~) cos y. 


A particularly significant quantity we can compute is p’(0). By (4.26), we have 


Oy Oy 
4 —_ — = 
p(0) 5(0,2n) + © (0,2n) = 0, 
so 
dp 
i —— 
(4.28) p (0) = De (0, 27). 


To compute the right side of (4.28), write 

(4.29) p=atep,(O™)+---, p=O+ey,(6)+--: 
Substituting into (4.27), we obtain 

(4.30) pi = —vU(acos@)sin@, g, = —=ui(acos 0) cos 0. 


Also, from (4.24) we have y1(0) = 0, so 


27 


(4.31) p'(0) = —y1 (27) = - : w(acos @) cos 6 dé. 


In the case 7)(w) = (w + A)”, we can evaluate the integral, to get 
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(4.32) p' (0) = 27 A. 


The integral that arises in (4.31) is often interpreted as an average, and the calcu- 
lation above is sometimes said to involve the “method of averaging.” For more on 
this topic, one can see [SV]. 

The accuracy achieved when these formulas were applied to the calculation of 
the precession of the perihelion of the planet Mercury-or rather that part of it not 
attributable to perturbations produced by the other planets—provided early positive 
evidence in favor of Einstein’s theory of gravity. 

We end this section with a remark on the value of K in (4.1), in terms of 
Newtonian concepts. Newtonian theory should be accurate for a planetary orbit 
on which r is large and the velocity of the planet is small. Let us evaluate the 
quantity o, defined in (4.2). We have 


— Cy _ 1 (7,T) 
oe) = 302 ~~ 2 (FZ)? 


by (3.12)-(3.13). Now, “small velocity” means T is essentially parallel to Z; 
hence (T,T) ~ —(T, Z)”, and so 


(4.34) on x. 


If also r is large, then (4.9) becomes 


a ee 2 
ss pw a(i- 4)" (oe+£- 2)” 


K [?2—-2EK?4+ _ 1/2 
; 


mw + [2# + (1-46) 


r r 


For the Newtonian approximation to be valid, we need E << 1. Also Ae << 1, 
with Ae = 30 K?/(2L7); in other words, K? << L?. Thus, 


(4.36) r 


x 
— 
i) 
my 
| 
| 
| 


If we compare this with the formula (4.10) arising from the Kepler problem, we 
see one difference; (4.10) has 2K instead of K. Since, in appropriate units, the 
ix in (4.10) is the gravitational mass of the attracting body (e.g., the Sun), we 
conclude that in (4.36), AK should be twice the gravitational mass. Thus, it is 
common to write the Schwarzschild metric as 


(4.37) ds? = -(1 as ) dt? + (1 as a dr? + 7? du? 


r 


and identify M/ as the “mass” of the solution, as seen at infinity. 
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Exercises 


1. In Fig. 4.3, consider the orbit with (3, 0) as forward and backward limit points. Assume 
8 < 1/K. Interpret the behavior of the corresponding solution x(t) of (4.2). 

2. Study timelike geodesics for the metric (4.1) inside {r < K}. 

3. Study the behavior of timelike geodesics on a spacetime with a Kerr metric, given by 
(3.56). One might consult the treatment in [Chan2]. 


5. Coupled Maxwell—Einstein equations 
The coupled Maxwell—Einstein equations are 
(5.1) Gyn = 8nKTjx, dF=0, d*F=0, 


in a spacetime in which there is an electromagnetic field F, but no matter. The last 
two equations in (5.1) are the Maxwell equations, discussed in § 11 of Chap. 2. 
The stress-energy tensor of F is given by (1.4), that is, 


1 £ 1 ie 
(5.2) Ty = = (Fi'Fee — 5 Fue F“ gin). 

We look for spherically symmetric solutions to (5.1). Thus, as in § 2, we first 
take the metric to have the form (2.4), so that Gz, is given by (2.51)—-(2.55). The 
hypothesis that F is spherically symmetric restricts its form severely. In fact, we 
can write 


F=dA+ct,r)o, 


where A is a 1-form and a is the standard area form on $?. The equation dF = 0 
implies c(t,r) = c, constant. If we assume the electromagnetic field decays 
to zero as r —> oo, then c = 0. We will make this hypothesis. By averaging 
with respect to the SO(3) action, we can arrange that A be invariant under this 
action. This implies that, for each orbit O of SO(3), the pull-back j5.A € A'(O) 
vanishes. Indeed, A'(O) has no SO(3)-invariant elements other than zero; equiva- 
lently, the sphere S ? has no SO(3)-invariant vector fields, other than zero. Hence, 
A has the form 


(5.3) A = a(t,r) dt + b(t,r) dr, 
SO 
Ob Oa 
(5.4) FS (F = =) dt \ dr = E(t,r) dt A dr. 


Thus, the only nonzero components of F;;, are Fo, = —Fy)o. 
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We deduce that all off-diagonal components of 77; vanish and that 


1 i 

AnToo = ~e7*E?, AnT,, = —~e"E’, 
2 2 

(5.5) ‘ 

4nT;; = 50 OE 933 fg = 2.3. 


In particular, since T\) = 0, it follows that G'1g = 0, so (2.52) implies 0A/Ot = 0, 
that is, \ = A(r). If we exploit Goo = 8m7KT 9 and G11 = 87KT)1, using (2.51) 
and (2.53), we get 


Vv Vv 


6.6) wE(t,r)? =-S (1-1, -e*) =-S (+m -e’). 


In particular, 0,.(A + v) = 0. Thus, as in the argument following (2.56), we can 
fix the t-coordinate so that v + A = 0, and hence the metric is again in the static 
form (2.57): 


(5.7) ds? = —e”) dt? + e—Y dr? +r? du?. 


Now the right side of (5.6) is a function of r alone, so EF = E(r); that is, the 
electromagnetic field F has the form 


(5.8) F = E(r) dt adr. 

Then the equation d* F = 0 is equivalent to 0,(,/—gF°!) = 0, where g = 
—e’e "rr? = —r4, so we have 

(5.9) E(r) = = 


for some constant q. If we substitute this formula for F in (5.6), we obtain the 
ODE 


(5.10) rv! (r) = (1 = “yew = 


d oy 
(5.11) (r= + 1)y@) =1- 4, 
dr r? 
a nonhomogeneous Euler equation with general solution w(r) = 1 — K/r + 
Kq?/r?, so 
re 


(5.12) MM=o1-—+4, 
Tr r 
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where we have set Q? = xq. Hence we obtain the metric 


2 
(5.13) ds? = (1 re 


r 


K 2\-1 
) dt? 4 (1 + 2 ) dr? + r? dw?. 
r r 


This is known as the Reissner—Nordstrém solution. It becomes the Schwarzschild 
solution when Q = 0. 

It remains to check that G22 = 87KT>2, which by (5.5) is equal to KE(r)?r? = 
Kq? /r?. In this case, (2.54) yields 


so we need 


2 2Kq? 
(6.14) U(r) + u(r)? + <v'(r) = be 
r 


This can be obtained from (5.10) in a fashion similar to the deduction of (2.60) 
from (2.58), so we can conclude that (5.8), (5.9), (5.13) is our desired spherically 
symmetric solution to the Maxwell—Einstein equations (5.1). 


Exercises 


1. Using the method of §§3 and 4, study the timelike geodesics (i.e., possible paths of 
an uncharged particle in free fall) for a spacetime with the Reissner—Nordstr6m metric 
(5.13). 


Exercises 2-4 deal with the Kerr-Newman metric, given in (t, r, 0, y)-coordinates 
by 
2 


(5.15) ds? = ~ 5 [atasin® yd]? o dr?-+p? dy? + 


where 
A=r’?—Kr+a@+Q’, p? =r? +a? cosy. 
Here, K > 0, a, and Q are constants. Note that the case a = 0 gives the Reissner— 
Nordstrém metric (5.13) while the case Q = 0 gives the Kerr metric (3.56). 
2. Show that (5.15), together with 
Fal * cos” y) dr A (dt — asin” y dO 
= —(r° — a’ cos” p) dr A (dt — asin” ¢ dé) 
p 
(5.16) 2Q 
+ eo y)(sin y) dy A [(r? +a’) do — adt], 


provides a solution to (5.1). 
3. Show that (5.15) is stationary, but not static, if a 4 0. 
4. Study the timelike geodesics on a spacetime with the metric (5.15). 
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6. Relativistic fluids 


In general relativity, the motion of an ideal fluid is governed by Einstein’s equation 


(6.1) Gx = 8TKTjx, 
where T’;;, has the form 
(6.2) Tyk = (9 + p)ujur + D9 5K- 


This is the stress-energy tensor of a fluid, with 4-velocity u, satisfying (u, u) = — 
1. The pressure is p, and the density is p, both of these quantities being measured 
by an observer traveling with velocity u. 

The condition that div G = 0 leads to fluid equations. In fact, a computation 
gives 


(6.3) div T = (p+ p)VuutLu(p+p) ut (pt p)(div u)u +t grad p. 


Note that, since (u, wu) = —1, u L Vu. Thus we can separate (6.3) into compo- 
nents orthogonal to and parallel to u and conclude that div T = 0 if and only if 


(p + p)Vuu = —II(u) grad p, 


6.4 
on div(pu) = —p div u, 


where II(w) denotes projection orthogonal to u with respect to the Lorentz metric. 
The case p = 0 is that of a dust; then (6.4) reduces to 
(6.5) V,u=0, div(pu) = 0. 


For an isentropic fluid, the pressure p is a function of p, so there is an equation of 
state 


(6.6) p= pip). 


One can compare (6.4) with the nonrelativistic fluid equations, (5.12)—(5.13), 
of Chap. 16, which are, (with X”(p) = Vp/p), 


0 
em (5 + Vow) = —Vp, 


— + div(pv) = 0. 
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This is an approximation to (6.4) when gj, ~ 1; (the Minkowski metric), u * 
(1,v), |v] << 1, and |p| << p. 

As in the study of nonrelativistic fluids in Chaps. 16 and 17, it is of interest to 
consider the vorticity of a fluid flow. First, if u is the 1-form corresponding to u 
via the Lorentz metric, we consider the 2-form 


(6.8) €= di. 


We can express this in terms of the linear map 


(6.9) B,:7T,M>T1,M, 2X =—-Vxu. 

In fact, 

(6.10) (X,Y) = (Vxu, Y) — (Vyu, X) = —((By — B4)X,¥). 

In particular, € = 0 if and only if =,, = &*. Note that since (u, u) = —1, we have 


0 = X(u,u) = 2(V xu, u), so (E,X, u) = 0, that is, 


(6.11) =, 17M Sh, “p= (ag) 
We also define 
(6.12) Axi My. Av= aly. 


Part of the significance of A,, is in determining whether the subbundle © of TM 
is integrable, as shown by the following: 


Lemma 6.1. The bundle & is integrable if and only if Ay = A*. 


Proof. If X and Y are sections of ), then 

(6.13) a([X, Y)) = —dti(X,Y) = (©,X,Y) — (X,5,Y), 
the last identity by (6.10). By (6.11)-(6.12), we obtain 

(6.14) au([X,Y]) = ((Au — At) X,Y), 


whenever X and Y are sections of ©. The lemma follows, by Frobenius’s theorem. 


It is useful to remark that the following formula holds: 


(6.15) div u = —Tr Ay, 


whenever (u,u) = —1. To see this, pick {e; : 1 < 7 < 3} to give a local 
orthonormal frame field for 3. We have 
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3 3 
(6.16) div u = $0 (Ve,u, ej) — (Vutt,u) = S$" (Ve, ts 3); 
j=l j=l 
the last identity holding since 2(V,,u, u) = u(u, u) = 0. This gives (6.15). 


The vorticity is the vector field W, defined as follows. If w is the volume form 
on M (a 4-form), W is uniquely specified by the identity 


(6.17) tww = tA dit. 


Note that if we wedge both sides of (6.17) with « and use the anticommutator 
identity \gew + twa = (W,%)I, we obtain 


(6.18) (W,t) =0, ie, Wp € dp. 

We can restate Lemma 6.1 in terms of the behavior of W: 

Lemma 6.2. The bundle & is integrable if and only ifW = 0. 

Proof. By (6.13), we see that © is integrable if and only if 

(6.19) dii( X,Y) = 0, VX,Y €X,, 

for all p € M. If we pick the basis {fo = @, fi, fo, fa} of TM to be the dual 
basis to {u, e1, €2, e3}, and write dui(p) as a linear combination of f; A fx, we see 


that (6.19) holds if and only if dii(p) = @/A a, for some a € T>; in turn this holds 
if and only if @ A du = 0, which holds if and only if W = 0. 


We can derive a “vorticity equation,” via calculations parallel to those used in 
(5.21)-(5.26) of Chap. 16. First, (6.3)-(6.4) imply 


(6.20) (p+ p)Vuti+ (Lup) + dp = 0. 
Next, we have £L,,% — Vytt = (1/2)d(u, u) = 0, so 
(6.21) Lyi + Bi+dq=0, 
where, assuming p = p(p), 


dp Lup 


(6.22) dqg= ——, B= = 
prp p+p 


Lud: 
Applying the exterior derivative to (6.21) yields 
(6.23) Ly€ = —d( Bi). 


We next produce an equation for £,,(% A €). If we start with 
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Luli d €) =A Ly€ + (Lutt) AE, 
and use (6.21)—(6.23), we obtain 
(6.24) Lu(t A §) = —d(qé). 


We can also produce an equation involving £,,W. Note the following 
characterization of W, equivalent to (6.17): 


(6.25) (iA £)Aa= (W,a)u, 
for every 1-form a. Beginning with 
Liu(iA&) Na=L,(iAENa)-—&AEA La, 


and making a computation parallel to that of (5.25)-(5.26) of Chap. 16, one 
obtains 


(6.26) LW + (div u)W = —0(¢8), 


where the vector field 0(q€) is uniquely defined by 


(6.27) (0(qé), a)w = d(gé) Aa, 


for all 1-forms a. The equation (6.26) can be compared with (5.26) of Chap. 16; 
the primary difference is that the right side of (5.26) is zero. 

Note that —d(qé) = d(t% A dq) and —0(qé) = 0(% A dq), so another way to 
write (6.24) is as 


(6.28) L,(uA €) =a), 


prp 


and another way to write (6.26) is as 


(6.29) LW + (div uw =0(2A2), 


prp 


We can produce a vorticity equation of A. Lichnerowicz as follows. Note that 


(6.30) (Lu + Blt = e 7L,,(e%%), 
so (6.21) yields 
(6.31) Ly(e4%) + d(e’) = 0, 


and applying the exterior derivative gives 


6. Relativistic fluids 703 
(6.32) L£,2=0, Q=dw, w=eBt. 
Note that © = e4(€ + dq A i), and hence 
(6.33) UAQ= el ANE. 


We can rewrite the form (6.31) of the first part of the Euler equations (6.4), 
using £,,(e%t) = 1,,d(e%%) — d(e%). Thus (6.31) is equivalent to 


(6.34) by = 0. 


The second part of the Euler equations (6.4) can be rewritten as 


(6.35) div u= i 
p+p 
which is also equivalent to 
(6.36) div w = LyV(p), 
with 
(6.37) w=elu, W(p) = loge *4(p +p). 
This in turn is equivalent to 
(6.38) d* GH = LyWV. 


Note that if we multiply (6.34) by e% and then apply the exterior derivative, we 
obtain the following variant of (6.32): 


(6.39) Ly Qf = 0. 
One relation between u A E and 2 is given in (6.33). If the Euler equation, in 


the form (6.34), holds, we can deduce another relation, via the anticommutator 
relation \jly, + t,Aq = —I. Applying this to Q and using (6.33), we obtain 


(6.40) ty =O => 1 = -e4 (HA €). 
Putting (6.33) and (6.40) together, we get 
(6.41) Q=0<—UAE=0, 


whenever 1? satisfies (6.34). This enables us to prove the following: 
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Proposition 6.3. Assume u solves the relativistic Euler equation. Let S be a 
spacelike hypersurface, and assume the vorticity W vanishes on O C S. Then 
W vanishes on the union U of the integral curves of u through points in O. 


Proof. That Q vanishes on U/ follows from (6.41) and (6.39); applying (6.41) 
again, we have u A € = 0 onl, hence W = Oonl. 


Note incidentally, that, when 1,0 = 0, 


(6.42) Q = ety lyw. 

Next we will derive a second-order PDE for w. To do this, we use (6.32) and 
(6.38) to compute LIw, where LJ = —dd* — d* d. We have 
(6.43) ® = —d(LyW) — d*Q. 


It is convenient to write 


(6.44) UV = O(e74), e74 — —(w,w), 
so 
(6.45) Ly WV = —26'(-(w,w)) (Vww,w). 


Since (X,d(w, w)) = X(w, w) = 2(Vxw, w), we have 


(6.46) d(w, w) = 214,Vw. 

Similarly, 

(6.47) AV ww, Ww) = twV (Vw) + o(V,,w) Vw, 

so 

(6.48) A(LyV) = —28' tyV(Vwt) + A(w, Vw), 

where 

(6.49) A(w, Vw) = —28! yw) Vw + 46" (Vw, wv) tw. 


Hence we have the coupled system 


(6.50)  — 26! 1, V(V pd) + A(w, Vw) = —d*Q, = Ly N=. 


A computation using (6.44), (6.37), and (6.22) gives 
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p'(p) — 1 


Poa 
(6.51) or 


In component notation, Q = t~V(VwwW) is given by 

(6.52) Qe = (wi w*w;-k) 2 = www; + Roe, 
where Ry involves only first-order derivatives. Now 

(6.53) lwQ =0 => wi wy. — w wes. 
Using this, one sees that 


(6.54) Qe = ww" wes-n + Re, 
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where Re involves only first-order derivatives. Thus, we replace the system 


(6.50) by 


(6.55)  — 28'V?2, i + A(w, Vw) =—-d*Q, Ly» =0. 


The left side of the first equation in (6.55) is a second-order, quasi-linear 
operator acting on w; its principal symbol is scalar, and provided p’/(p) > 1 (ie., 
provided p'(p) < 1), it is hyperbolic, and every hypersurface that is spacelike 
for the Lorentz metric gj is also spacelike for this operator. Of course, we have 
—d*Q on the right, and a second equation involving w and 2. Since £L,,0 involves 
first-order derivatives of w as well as of , the question of well-posedness of the 
initial-value problem for (6.55) requires further investigation. Following [CBr3], 


we clarify this by applying V,, to both sides of the first equation. 
Since the operator V,, has scalar principal symbol, 


(6.56) Vwd*Q. = d®¥VyQ + Bo(w, Vu, VO). 


Meanwhile, 


(6.57) — (Vw)(X,Y) = (Ly9)(X, Y) — A(Vxw, Y) — 2X, Vyw), 


SO 
(6.58) Ly Q® =0 = Vyd*O = B(D?w, VQ). 


Thus we replace the system (6.55) by 


(6.59) V»(O - 28'V?, ,,)@ — B(D?w, VO) =0, Ly Q=0, 
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which is analytically more tractable. Note that the first equation here contains no 
higher derivatives on (2 than the first equation of (6.55). 

Now the fluid velocity is also coupled to the gravitational field, via (6.1)-(6.2), 
so all of these equations have to be treated simultaneously. We will discuss this 
further in § 8. In preparation for that, let us mention that the first equation in (6.59), 
when written in local coordinates, involves the metric tensor and derivatives of the 
metric tensor, up to third order. 

We next construct some examples of static, spherically symmetric solutions 
to (6.1)-(6.2), which provide models for stable stars. We look for a solution 
involving a metric of the form 


(6.60) ds? = —e”"") dt? + OW) dr? + r? dw?, 


and use 9 = t, 1 =r, as in § 2. For the fluid to be static, we need 


(6.61) wae"? yaw=u' =), 
so 
(6.62) Tyr = (0 + ple” bj00K0 + P95K- 


Using (2.51)-(2.55), we see that (6.1) is equivalent to the following set of equa- 
tions, recording Gj; = 87KT},; for j = 0,1, and 2, respectively: 


1—rd, —e* = —8rKpr7e?, 


(6.63) 1+rv, —e* = 8rxKpr’e, 


1 1 
Vp + =U + =(Yy — Ap) aura = l6mKpe?. 
r 


If we assume p and p are related by an equation of state, p = p(p), as in (6.6), then 
(6.63) is a system of three equations, in three unknowns: v(r), A(7), and p(r). The 
system can be simplified a bit. 

If we apply e*(d/dr)e~> to the middle equation in (6.63) and subtract r times 
the last equation, we get 
(6.64) v'(v' + ') = —16rKre*p’. 
Meanwhile, taking the difference of the first two equations in (6.63) gives 
(6.65) v'(v' +d’) = 8rKre*(pt p)v’. 


Comparing these two equations, we have 


(6.66) p(r)= 0 + p)u'(r). 
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It is instructive to rederive this last equation, as a consequence of the vanishing 
of Tos, In fact, if wu is given by (6.61), then div u = 0, so by (6.3) the vanishing 
of div T' is equivalent to 


(6.67) (p+p)Vuut+Lu(ptp)u+ grad p = 0. 


Now uw, = O,u? +14 ¢,u*, where IY ¢;, is given by (2.9). When wu has the form 
(6.61), then u®us, = uud.go = (Ogu? + I oou°)u®. Also, by (2.9), M¥o9 = 
UTI 99 + Bi qo, and by (2.10), B499 = 0, while (2.43) implies 


1 
(6.68) UT 9 =0, YEo9 = gure. 
Thus, when u has the form (6.61), 
kj 1 gee 
(6.69) ww. = gure Oj1- 


Now, in the static, spherically symmetric case, p = p(r) and p = p(r), so clearly 
L.(p +p) = u°do(e +p) = 0, and the only nontrivial component of the left side 
of (6.67) is 


1 
(6.70) ee 5 (? + p)y'(r)e* + p'(r)e~?. 


Thus we again derive (6.66). 
We can use (6.66) to eliminate v, from the second equation in (6.63). The 
result, together with the first equation in (6.63), gives a2 x 2 system for A(1) and 


p(r): 


i r 
N(r) = a 8rKre*p, 
r 
2 1-—e 
——)|(r) = — 


ptp r 


(6.71) 
— 8rKrep, 


under the hypothesis that p = p(p). It is common to replace X(r) by a function 
giving the metric (6.60) a form more resembling (2.61). We define M(r) by 


(6.72) en) = (1 - se 


i 


so that M(r) = r(1 — e~>)/2. The system (6.71) takes the form 


(p+ p)(M + 4rKr3p) 


(6.73) M'(r) =4nKr2p, p'(r) = r(r— 2M) 
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known as the Oppenheimer—Volkov equation. 
In the Newtonian limit, p << p, 4aKr3p << M,and 2M << r, and these 
equations become 


(6.74) M'(r) =4nxr7p,  p'(r) = —. 


In fact, this is precisely the equation for a static fluid in Newtonian mechanics, in 
which the force of gravity exactly balances the force due to the pressure gradient. 
In such a case, M(r) is the gravitational mass of the matter enclosed in the ball 
{|x| < r} in R®. The relation between density and gravitational mass, given by the 
first equation of (6.74) (in the limit when Newtonian mechanics applies) serves to 
identify the constant « in Einstein’s equation (1.1), with the gravitational constant 
of Newtonian theory. 

The Oppenheimer—Volkov system (6.73) has consequences significantly dif- 
ferent from the Newtonian approximation (6.74), for very dense objects. For 
example, it leads to theoretical upper bounds on the mass of a stable neutron star 
which are stronger than those obtainable from (6.74). Discussions of this can be 
found in [Str, Wa, Wein]. 

In treating (6.73), it is natural to set 14(0) = 0 and let p(O) = po run over a 
range of values. We assume that p’(p) > 0 in the equation of state, so p = p(p) 
in (6.73), with p’(p) > 0. Despite the vanishing of the denominator in the second 
equation of (6.73) at r = 0, there is no real singularity there. Indeed, one easily 
verifies that 


M(r) = por? + O(r), 


3 
QTK 
P(r) = Po — 3 (Po + po) (3P0 + po)? + O(r*), 


(6.75) 


with 9 = p(po). For a numerical treatment of (6.73), it is convenient to use (6.75) 
for r very small, and then use a difference scheme, to produce an approximate 
solution for larger r. 


Exercises 


1. Assume u(p) 4 0 and W(p) 4 0. Using (6.42), show that the linear span £L, of u(p) 

and W (p) is given by 
Lp = {v € TM : wQ = Of. 

Using (6.32), show that the resulting subbundle £ of TM is invariant under the flow 
generated by wu (in regions where wu and W are both nonvanishing). In light of this, 
derive analogues of the Kelvin and Helmholtz theorems, established for nonrelativistic 
fluids in § 5 of Chap. 16 and § | of Chap. 17. 

2. Consider a static, spherically symmetric, charged fluid and associated electromagnetic 
field. Discuss the equations of motion. 
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3. Compute the second terms in the power-series expansions of M(r) and of p(r) about 
r = 0 in (6.75), namely, the coefficients of r° and of r+, respectively. 

4. Write some computer programs to solve numerically the Oppenheimer—Volkov system 
(6.73), with initial data 14(0) = 0, p(0) = po. Try various equations of state, such as 


(6.76) plo) = ke”, 
with & = const., used in models of white dwarf stars. For another example, fix po € 
(0, co), and use 
pip) = 4p, — forp> po, 


(6.77) 
p(p) =kp'’?, for p < po, 


with k picked so po/3 = kpal? (ie, k = po /?/3). 
See [Str] and [Wein] for discussions of variants of (6.77) used in models of neutron 
stars. 

5. Suppose the equation of state were 


(6.78) p(p) = § 


for all p € R*. Produce a solution to (6.73) of the form 
(6.79) M(r)= Ar, p(r)=Br™, 


for certain constants A, B. Relate this to the assertion that (6.78) cannot be a realistic 
equation of state at low density. 
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In many cases, solutions to Einstein’s equations, particularly coupled to matter, 
develop singularities in finite time, sometimes as part of the phenomenon of grav- 
itational collapse. We begin this section with some simple examples in which 
gravitational collapse occurs. 

Let us consider a homogeneous, isotropic universe, containing a fluid with 
uniform density and pressure. We write the metric as 


(7.1) ds? = —dt? + A(t)g°, 


where g” is a constant-curvature metric on a 3-manifold S. The stress-energy 
tensor has the form (6.2), with 


(7.2) p=p(t), p=p(p), uw=(1,0,0,0). 


We can compute the Einstein tensor of this metric using formulas from § 2. We 
have M = U x S, where dim U = 1 and dim S' = 3. From (2.22) we have 
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(7.3) Ricjz = Rich, + Fix, 


and F’;, is given by (2.28), with ¥ = log A(t). Hence (keeping in mind (2.20)) 
we have 


(7.4) eos ~$4-{ aa" = sayy, 
and, forl < 7,k < 3, 
I 2a nw 1 yng 
(7.5) Fix =—5A { AA" — 5(A’) of. 
By (2.29), the scalar curvature of MW is 
(7.6) S=A'Ss +8, 
where 
(7.7) pap", =34 A". 


Then, by (2.36), the Einstein tensor of M is given by 
g 1-4 1 

(7.8) GiR= Gir + 54 Sg 5j00k0 + Fir — 5 P9ik- 
In particular, 

14 1 1-4 3, a ane 
(7.9) Goo = —~A Sg+ Foo + =B= ~A Sgt -(A A’) ; 

2 2 2 4 
and, forl < 7,k < 3, 

Ss 3 WS Ss " 1 —lvqr2\_s 
(10) Gye = G8, + Fn — 5A" 99, = G9, — {A" — GAA) bo, 


Now, if S has constant sectional curvature (and hence constant scalar curva- 
ture Sg), then *Ric’;, must be a scalar multiple of 6/;,, and the multiple must be 
Sg /3, so 


1 
(7.11) C= — 5 SsGj: 


If Tj, is given by (7.2), then Einstein’s equations yield the following pair of 
equations for A(t) and p(t): 
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3/A’\2 18 
4 ( A ) 2 re eli 
(7.12) ; 
" 1 (A’) Sg _ 
A iA + 5 8rKAp(p). 


To put this in a slightly different form, we set A(t) = R(t)? and note that if 9 has 
constant sectional curvature K, then Sy = 6K. The we can rewrite (7.12) as 


(RYP+K= SrKpR?, 
2RR" + (R’)? + K = —8rKp(p)R?. 


(7.13) 


It is useful to perform some elementary operations on these equations. Note that 
taking the difference yields 


R” 
(7.14) 3 = —4rK(p+t 3p), 


while multiplying the first equation in (7.13) by 3 and taking the difference yields 


(7.15) 


4(B)-f-tminen 


On the other hand, applying d/dt to the first part of (7.13) gives 


dp R  Rd/R 
at) a a ae ), 


and substituting (7.15) into (7.16) then gives 


1 (R’), 


a ae 
gap qliek®) = Pa 


dt 
One can also deduce (7.17) from the identity T/ Me kz = 0 (with j = 0), in a fashion 
analogous to the derivation of (6.66) via (6.67)—(6.70). In turn, (7.17) implies the 
relation 
dR 1 dp 


7:18 = , 
es R 3 p+ p(p) 


which gives F as a function of p, or p as a function of R. 
Let us fix Ro = R(to) and po = p(to). We can now regard (7.14) as a dynam- 
ical equation for R: 


4 
(7.19) RY =—srKp(R), p(B) = (p+ 3p)R, 
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given p and p(p) as functions of R, and then the first part of (7.13) can be regarded 
as the conservation law: 


1 4 
(7.20) 5(R')° - gmnu(R) =—K, ~(R)=pR’. 
In other words, if we write (7.19) as a first-order system: 
/ / 4 

(7.21) R=V, VWe= —gtKvlR), 
then the orbits lie on level curves 

li». 4 
(7.22) F(V,R)=—-K, F(V,R)= a — gm (R). 


Thus we will examine these level curves. 
To do this, we look at (7.18), which gives 


p d¢ 
(7.23) R= Roe P)/3, i en 
: te) i 6 FPO 
If we assume that the equation of state satisfies 
(7.24) p(0)=0, p'(p) <1, 


then (if say p97 = 1) for p > 1, we have (1/2)logp < X(p) < loga, 
so Rop-/3 < R < Rop~'/®, with reversed inequalities for p < 1. Hence 
(Ro/R)? < p < (Ro/R)® for p > 1, so w(R) has the property 


Ro Ro 
(7.25) R= hy => R <¥(R) < Re 
Similarly, 

R& R3 
7.26 R> Ry = — < V(R) < —. 
(7.26) 2 Ro RS p(R) < R 


In Fig. 7.1 we depict the level curves of F'(V, R) and the resulting phase-plane 
portrait of the system (7.21). Note that all the orbits (R(t), V(t)) in the region 
V <0 have the property R(t) \, 0, V(t) \, —oo, as t increases. In particular, if 
V (to) < 0, then R’(t) is bounded away from zero for t > to, so R(t) must reach 
zero at a finite time t; > to! Similarly, if V(to) > 0, then R(t) must vanish for 
some finite t < to. Of course, at R = 0, p = +00, and the metric is singular. 
If K > O, one must have a singularity both at a finite time before tp and at a 
finite time after to. If kX > 0, there must be such a singularity either at some finite 
t < to or at some finite ¢ > fo. 
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K>0 
K=0 
R, 
: K=0 
( 


FIGURE 7.1 Orbits for (7.21) 


That such complete collapse must occur is not surprising in the case of a dust, 
where p = 0. However, it is striking that, given any realistic equation of state, the 
pressure cannot prevent the collapse to infinite density, even in the case K > 0 
and the total amount of matter in the universe is finite. 

One can cut and paste a portion of some of the spacetimes described above with 
a portion of Schwarzschild spacetime to give a model of collapse of a star, with 
spherical symmetry. The collapse of a rotating star is much more complicated. For 
further discussion, see [MS] and references given therein. It is worth mentioning 
the widely held belief that such a collapse, generally accompanied by gravitational 
radiation, should rapidly approximate a Kerr solution. 

There are a number of general results on the inevitability of gravitational col- 
lapse, accompanied by singularity formation. A detailed treatment is given in 
[HE], and we mention only one relatively simple case here. We show that under 
certain mild conditions, an irrotational dust must give rise to a singularity in space- 
time. We begin with a pair of geometrical lemmas. 


Lemma 7.1. [fu is a vector field satisfying 
(7.27) (u,u)=—-1l, Vyu=0, Ay = AX, 
then 


(7.28) L,,(div u) = —Ric(u, u) — Tr A2. 
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Proof. Let {e; : 1 < 7 < 3} bea local orthonormal frame field for ©, the bundle 
of 3-planes orthogonal to u. Then (6.16) yields 


(7.29) Ly, (div u) = (VuVe;U, ej) + (Ve; u, Vues): 


Here and below, we use the summation convention. Now write the first term on 
the right side of (7.29) as 


(VuVe,t, ej) = (Ve; Vutts €7) + (Vfu,es] tts 9) + (R(u, eg), €,) 
(7.30) 
= —(A,|u, air ej) = Ric(u, u), 
using V,,u = 0. If Ay = A%, we have 
(Vest, Vues) ~~ (Au[u, e,]; ej) = =(Aue;; Vue; T [u, e;]) 
= —(Ane;, Aue;) = 2(Aue;, Vues); 


(7.31) 


since [u, ej] = Vuej — Ve; u. 
The expression —(A,,e;, A,,e;) (summed over j) is equal to —Tr A? if AX = 
A,,. Furthermore, 


(Awej, Vues) = Aj(ex, Vues), 
where (A,;,) is the matrix of A,,, with respect to the basis {e,; }, which is symmet- 


ric. Since (e,, V ,e;) is antisymmetric in (j, k), we deduce that (A,,e;, V,e;) = 0 
(summed over 7). This proves (7.28). 


Recall from Lemma 6.1 that A,, = A*, is an integrability condition, and, by 
Lemma 6.2, it is equivalent to vanishing vorticity. Note that if A,, = A*, then 


— 


TA > g (Tr Ay)", 


as can be seen by putting A,, in diagonal form. Using (6.15), we can deduce the 
following: 


Lemma 7.2. Under the hypotheses of Lemma 7.1, 
ee 
(7.32) Lu (div u) < —3 (div u)* — Ric(u, wu). 


Proposition 7.3. Suppose M is a spacetime containing a dust, so the Einstein 
equations hold, with Tx given by 


(7.33) Tik = PUjUK, 


with p > 0 and (u,u) = —1. Assume that the motion of the dust is irrotational. 
Finally, assume that, for some p € M, 
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(7.34) div u(p) = —b < 0. 


Let y be the orbit of u (a unit-speed geodesic) such that y(0) = p. If y(7) is 
defined for r € [0,a), then a < 3/b. Furthermore, if a = 3/b, then 


(7.35) p(y(T)) + +00 and §$(y7(r)) ++00, ast fa. 


Proof. Under our hypotheses, Corollary 7.2 applies, so we have (7.32). Also, by 
(1.61), 


(7.36) Ric(u, u) = 47Kp, 


which is > 0. Hence, 


1.37) f(r) = div uly(r)) = £0) S ZF), 


so the hypothesis (7.34) implies 


3b 


(7.38) fis ac ae ee 


for 0 < r < a, provided f(7) is smooth on [0, a). This shows that a < 3/b. Also, 
if a = 3/b, then f(T) > —coas 7 a, in such a fashion that 


(7.39) i f(r) dr = -o0. 


To conclude the proof of (7.35), note that, given a dust, by (6.5) we have 
div(pu) = 0, hence Ly, + p div u = 0, or 


d 
L,,(log p) = —divu, ice., = log p(y(7)) = —f(r). 


Hence (7.39) implies the first part of (7.35). By (1.59) we have 
S = 81Kp, 


so the second part of (7.35) also holds. 


Stability issues in black hole formation 


A star goes through stages where it exhausts first its hydrogen, leading to con- 
traction of the core, heating, and ignition of helium, then it exhausts its helium, 
leading to further heating and subsequent fusion of carbon and heavier elements. 
A star significantly more massive than ours continues until its core exhausts all 
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the elements lighter than iron. Then fusion no longer releases energy, and gravity 
crushes the core, creating what is known as a type II supernova. The end prod- 
uct of this might be a neutron star, supported by the Fermionic pressure of the 
constituent neutrons. If the mass is great enough, even this cannot prevent further 
collapse, to a black hole. As we stated earlier, the standard scenario is that the 
collapse rapidly approaches a Kerr solution (described in (3.56)). Since the star is 
typically rotating, one expects a rotating black hole, rather than the Schwarzschild 
solution. The rotation is expected to slow over time, and one expects to see a 
smooth progression as a family of Kerr solutions, accompanied by gravitational 
radiation. 

Mathematical analysis of this situation raises many deep issues, one of the 
foremost being the issue of stability of various solutions to the Einstein equations. 
We mention some of the literature on this. 

The first breakthrough, in [CK], concerned the stability of flat Minkowski 
space. This got a book-length treatment. Eventually a shorter treatment was given 
in [LR]. On the other hand, a stronger stability result, valid for a larger class 
of perturbations, was given in [BiZ]. See also [Bi]. Further expositions of the 
formation and stability of black holes, and of gravitational radiation, include 
[Chr, BGY, Bi2], and [Gio]. Further structure of perturbed Minkowski solutions is 
analyzed in [HV2]. 

Stability of Schwarzschild black holes is treated in [DGIM], and stability of 
slowly rotating Kerr black holes in [HHV, KS], and [GKS]. The paper [HV] 
addressed the global nonlinear stability of the Kerr-de Sitter family of black holes, 
which solve the Einstein equations with a modification to include a cosmological 
constant. 

The issue of decay of waves, including linear waves, is closely aligned with 
stability issues, and has been attacked in [MTT, MT2], and [Tat]. 


Exercises 


1. Obtain more explicit solutions to (7.13) for a dust (1.e., when p = 0). (Hint: Use (7.17).) 

2. Assume p = 0. Let M have a metric of the form (7.1), by solving (7.12). Let B be 
a ball in the 3-manifold S, used in (7.1), and let Q be the subset of M formed by 
timelike geodesics through OB, orthogonal to S. Glue Q to an appropriate piece of 
Schwarzschild spacetime (whose boundary is also swept out by timelike geodesics) 
so as to model the collapse of a dust ball. In what sense do (6.1) and (6.2) hold on a 
neighborhood of the interface? 

3. Consider the behavior of a homogeneous, isotropic universe filled with a uniform fluid 
(i.e., consider solutions to (7.12)), when one does not require that the equation of state 
satisfy p’(p) < 1. 
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8. The initial-value problem 


To begin, we look at the initial-value problem for the empty-space Einstein equa- 
tions, which can be written as 


(8.1) Ric; = 0. 


In our first formulation of the initial-value problem, we try to prescribe, on a 
3-dimensional surface S = {a9 = 0} C M, the initial data 


(8.2) 95k] g = 95K. OogGik |g = Kyx- 


Here, 0p = 0/Oxp. Take M open in R*+, S = MM {xq = 0}. We will see shortly 
that compatibility conditions will be required on these data. In local (xo,... , 73)- 
coordinates we have 


=, ee 
Ric;, = ml [—OcOm9jx — O;OnGem 


(8.3) + OnOm 903 + 909; Gkm| ay M3x(9, V9) 


= Lix(g,D)g + Mjx(g, V9). 
Now it is easy to see that S = {a9 = 0} is characteristic for £. This results 
from the coordinate independence of the condition (8.1). We can get around this 
problem by choosing coordinate systems of a special nature. 


One way to do this is the following, used by [CBr2], following [Lan]. Rewrite 
(8.3) as 


te 1 1 
(8.4) Ricj, = =" O0m 95k + 9 Iie OME + a Ire Oj" + Hj (9, V9); 
where 
. _—a 
(8.5) MN = oT, = ao 9 (OkGjm + Oj 9km — OmGjk), 


and the I’ jk are the Christoffel symbols for the metric tensor ( Ij x), in the coordi- 
nates (2,..., #3). If 0 is the Laplace—Beltrami operator defined by the Lorentz 
metric (gjx), then 


(8.6) U= gr ss k = gi* 0;0,u — r Apu. 


Hence 


(8.7) —Or, = ». 
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In other words, if the coordinates x; satisfy Ux; = 0 (we call them “harmonic 
coordinates’’), then \ = 0. Thus, in a harmonic coordinate system, Ric;;, is equal 
to 


D 1 m 
(8.8) Rik = 59°" DeOmgijx + Hjx(9, V9). 
At this point we can solve the initial-value problem 
go" WOmgjk — 2H;x(g, Vg) = 0, 
(8.9) - ° 
95% | = Gon: Bo95%| g = Kye, 
as long as g jx defines a Lorentz inner product on T;,M, for which T;,S' is space- 
like, for each p € S. In such a case, this is a quasi-linear hyperbolic system, to 
which the results of Chap. 16, § 3, apply. We will also have a solution to (8.1), if 
we can show that \/ = 0. To that end, we establish the following: 
Lemma 8.1. If Ryx = 0, then » satisfies a system of PDE of the form 
(8.10) O90" + BY (g, Vg) OA” = 0. 


Proof. By (8.4) and (8.8), if Rix = 0, then 
. 1 Poe : 
Ricjk = 595¢ On” + 5 Ike 05%’, 


and hence 


1 ; ; 
(8.11) Ce 5 (O*\) + OA* — gO dN). 
Thus 
2GI* = (a*\ + OA") a g* (OeA*) x, 
and a straightforward calculation gives 
(8.12) 2GI* , = 0,0") + BI (g, Vg) Oer™. 
Since G?*., = 0, we have (8.10). 


We note for future reference that, without the assumption that Ryr = 0, we 
have 


— be Le 
(8.13) O,0*\7 + B2(g, Vg) XA” = —2T* 4, Tye = Rik — 5 egin- 
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Now, (8.10) is a linear hyperbolic system for \’. We can deduce that \7 = 0 
on a neighborhood of S if we have 


(8.14) M\,=0, OoA7|, =. 
Arranging this requires placing appropriate compatibility conditions on the 
Cauchy data (8.2). We now turn to this. 


From (8.5) we have 


0 £0 od 
e 1.40 0 [o} ° 
(8.15) M|g=h oo kj + F(g)Vsg, 


where the last term is linear in Vgg = (Arg, 2g, 39). Also, from (8.11) we see 
that 


(8.16) 24° x = Opr° + Gre Dor — 64 Oer, 
provided Ryr = 0. Consequently, 
(8.17) Ryp =0, "|, = 0 => 24s | 5 = 9n000"| 5. 


At this point it is convenient to record the following observation: 


et dhe_tod ; ° . : 
Lemma 8.2. The restriction G°), Fr is given in terms of gj, kjx, and their tan- 


gential derivatives; it does not involve 03.9;x- 
Proof. From (8.3) we have 


1... 
G°;, = 5919" (Om Ge; + 0,0; 9%m — WOmGjr — 9; 9nGem) 
— 36x97 9" (D:OmGe3 — O0Omg;i) + He(g, V9). 


(8.18) 


The contribution of the terms involving 03 is 1/2 times 


G93 Gg 5 06.905 + G99 83 Fem — 9°99"? OG; — Ong Gg” 93 9em 
— (8.9799 6.905 — ng 9” O6958)) 


which clearly vanishes. 


Let the resulting formula for G°;, | g be 


(8.19) eae = 9x (j4:D 29 5x) hie, Vokjn)- 


We now state a local existence result: 
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Proposition 8.3. Suppose the initial data (8.2) are C® on S and satisfy the con- 
sistency condition 


(8.20) GK (9;%: DS9jn1 kik, Vskjx) = 0. 


Assume S is spacelike for ee Then the initial-value problem (8.1)-(8.2) has a 
C™-solution on a neighborhood of S. 


Proof. Start with the tensor field 
(8.21) Gje(x) = 9jn(2") + vokjn(2’), x" = (a1, 22,28), 


which defines a Lorentz metric on a neighborhood of S. Then the Einstein tensor 
G?* of this metric satisfies (8.19) and hence G®;, bs =0. 


Now define smooth “harmonic” coordinates yo,..., y3, by solving 
(8.22) fy; = 0, 
with Cauchy data 
(8.23) Wig=Zilg ile = 4r5| gs 


where Clu is the analogue of (8.6) for the metric g;,. Rewrite the initial-value 
problem (8.1)-(8.2), in this new coordinate system. 
Then (8.2) takes the form 


é a 
(8.24) gin (0, 4’) = 954 (Y'), Dy Ory) = kj (y'); 


the functions 9; , are not changed, but the k;,, do undergo a change. Due to the 


tensor character of G2", we have 


(8.25) G (Gis DS Is» Kye V skin) = 0. 
Now solve the system (8.9) for g;x, in the (yo,..., y3)-coordinates, using the 
initial data (8.24). We claim that (yo,...,y3) are harmonic coordinates for the 


Lorentz metric g;;,. In fact, by Lemma 8.1 it suffices to show that if d* are given 
by (8.5), then A“| , = 0 and (0/Ayo)A‘| , = 0. Note that (8.22), together with 


(8.15), implies that A‘ | g = 0 when ¢ is determined by any metric gjx Satisfying 


(8.26) Ijk — jk = O(y6)- 
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Thus we have ae = 0 in our case. Next, by (8.25) and Lemma 8.2, ere = 0, 
so (8.17) implies (0/Ay0)A"| , = 0. 

Since (yo,...,Yy3) are harmonic coordinates for the metric gj, we have (8.1) 
as a consequence of (8.9). Converting back to (9, ..., 23)-coordinates, we also 
have (8.2), as a consequence of (8.26). This completes the proof. 


For simplicity, we have not specified which Sobolev spaces are needed for 
the initial data. Due to the special structure of Einstein’s equations, one can 
obtain solutions with less regularity than is needed for general second-order, 
quasi-linear hyperbolic systems. Results on this can be found in [HKM] and in 
Chap. 5 of [Tay]. 

We have the following local uniqueness result: 


Proposition 8.4. Suppose gj, and 9; , are two smooth solutions to (8.1)-(8.2), on 
a neighborhood of S. Then there exists a C~-diffeomorphism yp on a neighbor- 
hood of S, such that 9 = id. and y*g = g’. 


Proof. Without loss of generality, one can assume that the coordinates 
(xo,..-,%3) are harmonic for the metric g;,. Parallel to (8.22)-(8.23), solve 
‘y; = 0, Uy = Xj, dy,| g = dz,;, where L1’ is the Laplace—Beltrami operator 
for the metric g’. Then the diffeomorphism y(y) = x does the trick, since the 
system (8.9) for g (in the x-coordinates) is precisely the same as the system for 
g’ (in the y-coordinates), and solutions to this quasi-linear hyperbolic system are 
locally unique. 


We have seen that one way to “hyperbolicize” the equation (8.1) is to use 
harmonic coordinates. We now discuss an alternative method, due to D. DeTurck 
[DeT]. In this method, (8.1) is modified to 


(8.27) Ric(g) — div* (W~' div 6(W)) =0, 


where W is a convenient second-order symmetric tensor field, which we will 
specify below, and 6 acts linearly on S?T*, by the rule 


1 
(Tr W) 93x. 


(8.28) BW) jn = Wie — 5 


In fact, if the initial data for g;;, are given by (8.2), we set 


(8.29) Waal) = Gyx(x) = Gjx(2") + eokju(2’), 


as in (8.21). Note that, upon lowering an index, W defines an invertible endomor- 
phism on the tangent bundle T; we denote the inverse by W~t. 

For any given invertible W, B = div*(W—! div 6(W)) depends on the metric 
tensor gj,; a calculation shows that it is given by 
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(8.30) Bix = 57" [—9; OxGem + Om Ok Ge; + 90; 9mk] + Cor (9, VQ). 
Comparison with (8.3) gives 

. Lae 
(8.31) Ricj, — Bj, = oo OOm9jk + Mjr(g, V9) — Ce (g, V9): 
Thus the equation (8.27) is strictly hyperbolic. Again, the results of Chap. 16, § 3, 
apply. We now want to show that if the initial data (8.2) satisfy the compatibility 
conditions of Proposition 8.3, then B,;, = 0, so again we get a solution to (8.1). 
More precisely, we establish the vanishing of 
(8.32) u=W' div 6(W). 
In fact, applying div o © to both sides of (8.27) and using Gyn” = 0, we have 
(8.33) div 6 div*u = 0, 
when g satisfies (8.27). Note that, for any covariant vector field v, 


: i 1 
(div 6 div"v); = 59" (Cemsi — Uj;6m — Ve;j3m) 


1 e 
_ aig 


(8.34) 
Us bem = Ric! jv), 


so (8.33) is a strictly hyperbolic equation for u. Now, the construction (8.29) of 
W guarantees that div 6(W) = 0 on S = {xo = 0}, so 


(8.35) ti| =U, 


Thus, the vanishing of u would follow from ou g = 0. To get this, we use a 
lemma: 


Lemma 8.5. [fv is a covariant vector field on a Lorentz manifold (M, g), then 
(8.36) v| g =0, O(div*v)?; |, = 0 => dov|, = 0. 
Proof. We have 


G(div*v) 5 = 5 (vj? + v9.5 — 059°" Vem): 


N| rR 


Hence 


1 
vg =0=> G(div*v)°; = 59° O00 on S, for j =1,2,3. 
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In particular, the hypotheses in (8.33) yield that Aov5| = 0, for 7 = 1, 2,3. 


Granted that, we see that, on S, 9!" ve.m = g°° v0.0, 80 


1 
G(div"v)"o| = 59° Aor on S, 


and this yields the complete implication (8.36). 


Now, the compatibility conditions (8.20) on the initial data imply that 
Rig’; | = 0, so if (8.27) holds, then u = W~! div G(W) satisfies all the 
hypotheses of (8.36). Thus ou s = 0, so we have the following result: 


Proposition 8.6. [f the initial data satisfy the compatibility conditions (8.20), 
then the solution to the hyperbolic system (8.27) is also a solution to (8.1). 


Having examined the empty-space case, we next consider Einstein’s equations 
coupled with Maxwell’s equations for an electromagnetic field, (5.1). In parallel 
with the approach to the empty-space case in (8.9), we will consider the system 


Ls ge, ; 
(8.37) — 57g! O0m9jk + Ayn (g, V9) = san (Typ = 5T9ik) 
F =0, 


where, in the first equation, T = Ti j and 


1 1 . 
es (FF | uF gin) 


: Lie 
(8.38) ik = 7 


is the stress-energy tensor for the electromagnetic field, as in (5.2). We have 
obtained the second equation in (8.37) from dF = 0 = d*F, by using 0 = 
—dd* — d*d. In local coordinates, this operator depends on gj, and derivatives 
up to second order; in fact, 


(8.39) (OF) 5% = 9°” O0OmF jx + Ejx(D?g, VF). 


Let us pose initial data on a compact hypersurface S, including the data (8.2), 
with 


(8.40) jn © He*1(S), kjx € H°(S). 
We assume that S is spacelike for these data and that s > 7/2. We also specify 


(8.41) Fyrlg € H°(S), OF ;%|, € H°-*(S). 


lg 


We will postpone placing compatibility conditions on these data. If we identify a 
neighborhood of S with I x S, I = (—a,a), then, for a sufficiently small, we 
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will obtain a solution to (8.37)-(8.41) satisfying 


9 € C(I, H**"(S)) NC’, H°(S)), 


=) FECLEA (SE) 1c CA). 

The local existence of solutions to (8.37) is established by a slight variant of the 
method developed in §§ 1-3 of Chap. 16 to treat hyperbolic systems. One obtains 
first-order systems for (y, =), with yp = (Agjx, Oogjr) and = (AF jx, OoF jx). 
From there one solves approximating systems for (y-, ¢-) and uses energy esti- 
mates plus Gronwall’s inequality to establish 


(8.43) Ilee(t) lire + [We lF2-1 < C, 


for |t| < a. We require that s > 7/2, so H*(S) C C?*"(S) for some r > 0. 
From (8.43) it follows that a limit point exists, yielding a solution to (8.37)— 
(8.41). As in Chap. 16, one establishes uniqueness, and the continuity described 
in (8.42). The reason one uses different Sobolev estimates for y(t) and for y(t) 
is the occurence of second-order derivatives of gj, in (8.39), compensated by the 
fact that no derivatives of F;;, are involved in the first equation of (8.37). 

Now, assume that F | g and OoF | g Satisfy the compatibility conditions 


(8.44) dF =0=d*F on S. 


I 
a 


and Oid* = d*O, we have 


Since Od 


(8.45) (dF) =0, O(d*F) =0. 


We deduce that dF = 0 = d*F on I x S. As discussed in § 1, this implies that 
Tyr, given by (8.38), satisfies 


(8.46) TIF, =0. 


’ 


We next want to show that if 95x| g and dog;x| g Satisfy appropriate com- 


patibility conditions, then 4“ = —Olzy = 0, so in fact the Einstein equations 
Gy = 87KT)}, follow from (8.37). 


Lemma 8.7. Assume that we have a solution to (8.37)(8.41) on I x S' and that 


(8.46) holds. Assume that Cale = 0, for 0 < € < 3, and that, forO < k <3, 


(8.47) De (Gies Dad sn kik Vokje) = 8TKT x. 


Then Y =O0onI x S. 
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Proof. If (8.39) holds, then (8.13) holds, with Tik = 8rKTjx. In fact, (8.39) 
implies that Ro = —87 KT, 80 


~ ~ 1 1 
Tyk = Ryjg + ATK GR = 8 (Tip = 37 95% + 579i) = 81K Typ. 


Now, a computation parallel to that yielding (8.17) shows that in the present case, 
(8.48) | 5 = 0 => 248], = ge200A"| 5 + 2T°x- 

Hence, if (8.47) holds, the hypothesis \’ = 0 on 9 yields 

(8.49) MeO; Gx 2g 

Also, by (8.46), we have TI, = 0, so (8.13) gives 

(8.50) O,0" + BY (g, Vg)erA™ = 0, 


and the initial-value problem (8.49)—(8.50) has only \’ = 0 as a solution, so the 
lemma is proved. 


From here, one obtains the following parallel to Proposition 8.3: 


Proposition 8.8. Suppose the initial data in (8.40)-(8.41) satisfy the consistency 
conditions (8.44) and (8.47). Then there is a solution to 


Gyr = 8TK Tk 


satisfying these initial conditions, where T;;, is the electromagnetic stress-energy 
tensor, given by (8.38). 


We next consider Einstein’s equations coupled with the equations of fluid 
motion. We use the form (6.59) of these equations, namely, 


(8.51) Vw(O — 28'V2, ,,)w@ — B(D°g, D?w, VQ) = 0 
and 
(8.52) £,0=0, 


As in (8.37), we write Einstein’s equations as 


1 1 
(8.53) 5.9" OOmgjx + Hjx(g, V9) = 81K ( Tix — 57954); 


where 7 = T! ; and this time 
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(8.54) Tir = (p+ p)ujur + Pg jk 


is the stress-energy tensor for a fluid with 4-velocity u, density p, and pressure p. 
As in (6.37), 


dp 


(8.55) w=elu, dq=——, 
pp 


and w is the 1-form associated with w. Since we want (8.51)—(8.53) to be a system 
of equations for (w,Q, g), let us rewrite (8.54) as 


prp 


B56) Tre =— Tey wiwk + Pose, P= p(—(w,w)), p=p(p). 


The formula for p as a function of (w,w) follows implicitly from (8.55), since 
e74 = —(w,w). 
We have made a slight notational change from (6.59) to (8.51), recording 


the dependence of B on D%g. Clearly, the coefficients of the operator V,,(C — 
26'V%, ,,) also depend on D*g. Recall that 


p'(p) ~1 


a 
(8.57) iawn 


Also, as long as the equation of state satisfies p’(p) > 1, (8.51) is strictly hyper- 
bolic, and any hypersurface that is spacelike for the metric gj, is also spacelike 
for (8.51). 

Let us pose initial data on a compact hypersurface S, including the data (8.2), 
with 


(8.58) Son € H°+2(S), jn € +15). 

We assume that S is spacelike for these data and that £ > 7/2. We also specify 
(8.59) w;|,€ HS), Aw;|,€H"(S), Sw;|,€ 4° (S), 
and 


(8.60) O;%|, € H*(S). 


Of course, there will be compatibility conditions that we will ultimately want to 
place on such data, as discussed below, but these are not needed for the solvability 
of (8.51)-(8.53). We will assume that w| gs is timelike: 


(8.61) (w, w)| g < —Co < 0. 
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We identify a neighborhood of S with I x S, I = (—e,¢). Using the methods 
of Chap. 16, in a fashion parallel to our discussion of (8.37)-(8.41), we obtain a 
solution to (8.51)-(8.53) satisfying (8.58)—-(8.60) and 

g € C(I, H***(S)) nC (1, H**1(S)), 
(8.62) w € C(I, H**1(S)) nO? (1, H*"(S)), 
2 € C(I, H"(S)). 


We leave to the reader the demonstration that appropriate consistency condi- 
tions on the initial data then imply that the Euler equations (6.4) are satisfied. 
This in turn implies T/*.;, = 0, and hence Lemma 8.7 is applicable. From there 
one proceeds as before to show that when the initial data satisfy the consistency 
conditions, one has a solution to (6.1)—(6.2). 

As in the case of nonrelativistic (compressible) fluids, one also considers the 
phenomenon of shock waves in relativistic fluids. For some work on this, see 
[Lich3, Lich4, ST, ST2, Tau2]. 


Exercises 
1. Write out the principal symbol L(x, €) of the operator £ in (8.3), and verify that 
det L(x, €) = 0, 


for all € € R*\ 0. 

2. Show that under appropriate consistency conditions on the initial data, solutions to 
(8.51)-(8.53) also satisfy the Euler equations (6.4). (Hint: For one approach, see 
(CBr3].) 

3. Discuss the appropriate initial-value problem for the relativistic motion of a charged 
fluid, coupled both to the metric of spacetime and to an electromagnetic field. The 
resulting equations are the equations of relativistic magnetohydrodynamics. 

Material on this can be found in [Lich3]. 

4. Make use of finite propagation speed to eliminate the hypothesis that the initial surface 

S be compact, in the results of this section. 
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It is of interest to consider further when the initial data (8.2) satisfy the consistency 
condition (8.20). When 9; , is restricted to T'S, we get a Riemannian metric on 
S; hip = 9; ,» for 1 < 7 < 3. Let us assume for now that the coordinate system 
(ap,..-, 23) has not only the property that S = {a9 = 0} but also the property 


that the vector field 0/0x9 is orthogonal to S. Thus Gor =Oforl<k< 
3. Suppose also that 0/Oxzp = N is a unit vector on S, (N,N) = —1 (ie., 


Soi = —1). Using the Gauss—Codazzi equations, we can express G°;, | g in terms 
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of the metric tensor hj, of S and the second fundamental form of S C M, which 


we denote as /X;;. Denote the associated Weingarten map by A : T,S — TS. 
The Gauss equation, (4.14) of Appendix C, implies that, for X tangent to S, 


Ricyy(X,X) — Rics(X,X) 


3 
(9.1) = (RM(N,X)X,N) + 5°((A?A)(E; A X),X A E;), 


j=l 
where {£, £2, £3} is an orthonormal basis of T,,S. From this, we have 

(9.2) Su — Sg = —2 Tr A?A — 2 Ricy;(N, N), 

where Sq, is the scalar curvature of M, and Sg the scalar curvature of S. Compare 
with (4.72) of Appendix C. There is a sign difference, since here (N, N) = —1. 
Since Goo = Ricoo — (1/2)Sargoo, we have 


Su = Ss =-—2Tr A?A = 2G 0 = S900, 


or equivalently, 
0 1 2 
(9.3) Gols = 55s + TrA?A. 
Meanwhile, the Codazzi equation, (4.16) of Appendix C, implies 


(9.4) Kyrie = Ker.j — Reoje. 


We define the mean curvature H of S C M to be H = (1/3)Tr A = (1/3) K9;. 
The identity (9.4) implies K;* 6 = Kyt.; - Re ose, and hence 


(9.5) K;* 4 = 3H; + Ricoj. 
Since G°; = Ric®; — (1/2)Sig°; = Ric®,, for 1 < j < 3, we have 
(9.6) Qae= hy esha Tapss. 


We have the following result: 


Proposition 9.1. [f S is a Riemannian 3-manifold with metric tensor hj, and 
Kj, is a smooth section of 8*T*(S), then there exists a Lorentz 4-manifold M 
that is Ricci flat (i.e., satisfies (8.1)), for which S is a spacelike hypersurface, with 
induced metric tensor hj, and second fundamental form K jx, if and only if 


(9.7) Ss — Kj, K* + KI; K*, =0 
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and 
k _ 
(9.8) Kj", — 3H; = 0. 


Proof. We have just seen the necessity of (9.7) and (9.8). For the converse, if hjx 
and Kj, are given, satisfying (9.7) and (9.8), set 


(9.9) Gjn = hye £155,453, Go =—1, Gj, =0, otherwise. 


Also, set 


O° 


(9.10) kjr = —2K jx if 1 < j,k <3, kj, =0, otherwise. 


Note that, for any metric gj, satisfying (8.2) with these initial data, hj, and Kj, 
do specify the first and second fundamental forms of S = {ao = 0}. See (4.69) 


of Appendix C. The fact that this prescription yields 9; ;, and hj,, which satisfy 
the compatibility conditions of Proposition 8.3, thus follows from (9.4) and (9.6). 


The index raising and covariant differentiation performed in (9.7) and (9.8) are 
operations defined by the metric tensor hj, on S'. Note that (9.7) follows from 
(9.3), via the identity 


Tr A?A = —(Tr A)? — ; ira, 


1 
2 
which is readily verified by using a basis that diagonalizes A. In the physics liter- 
ature, (9.7) is called the Hamiltonian constraint and (9.8) is called the momentum 
constraint. Together, (9.7) and (9.8) are called the constraint equations. 

Note that special hypotheses about the coordinates used on M disappear in the 
formulation of Proposition 9.1, which is convenient. 

We can define the trace-free part of Kj,: 


(9.11) Qin = Kyx — Hhjr. 
Then the system (9.7)-(9.8) becomes 


(9.12) Sg — QjnQ* + 6H? = 0, 
(9.13) Q;*.4 — 2H.; = 0. 


This system has been studied in [Lich1, Yol, OY1,CBr5, CBr6, CBY]. Follow- 
ing their work, we will investigate this system more closely in the special case 
when H is taken to be constant on S, which we now assume to be compact. Then 
(9.13) specifies that @;;, is a divergence-free (trace-free, order 2, symmetric) ten- 
sor field. We show how to construct all such fields on a compact Riemannian 
manifold (.S, h). 
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Let us define Dp on vector fields by 
1 
(9.14) DrprX = Def X — 7 div X)h, Drr:C°(S,T) > C™(S, aot), 


where Def is the deformation operator, (Def X) jx = (Xj.4+Xx;;)/2, and where 
Ae T* denotes the bundle of second-order, trace-free, covariant tensors. A calcu- 
lation yields 


(9.15) DS —div| cone 
Hence 
- ‘ 1 : 
(9.16) DrppDrrX = —div Def X + — grad div X. 
n 


The operator £ = D7.,DP rr is a second-order, elliptic, self-adjoint operator, and 
there is a Weitzenbock formula, which implies 


(9.17) |DrrX|[22 = 5 IVX [32 + (5 ——lldiv X|[Z2 — 5 (Ries(X), X) p. 
Compare with formulas (4.26)-(4.31) of Chap. 10. 

The kernel of £, which is equal to ker Dr p, consists of conformal Killing 
fields, as noted in (3.39) of Chap.2. It is a finite-dimensional subspace of 
C™(S,T). Let E be the inverse of £ on the orthogonal complement of ker 
£, and zero on ker CL. It follows that 


(9.18) E> H*(8,T) —> H***(S,1), 


for all s > —1, by the elliptic regularity results in Chap. 5, § 1, and more generally 
for all s € R, by the construction of E € OP.S~?(S) in Chap. 7. Now set 


(9.19) Pi =DppE Dip, Pi: H°(S, 5827") 4 H°(S, S27"), 


The operator P, is the orthogonal projection onto the range of Dp p in L?, that 
is, the image of Dr p acting on H'(S,T), and Py = I — P, is the orthogonal 
projection onto the kernel of Dj.,, in L?(.9, S37*). From (9.19) it follows that 


(9.20) Py: C®(S, S2T*) —+ C™(S, S27"). 


The set of smooth, divergence-free, trace-free, second-order, symmetric tensor 
fields Qj, is precisely the image of Pp in (9.20). 

Now, if we have a Riemannian metric hj; on S and a solution Qj, to (9.13), 
with H constant, the scalar curvature may not satisfy (9.12). In [Yol] and [OY 1], 
following Lichnerowicz, who treated the case H = 0, it was shown that the triple 
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hr, H, Qj) leads to a new triple (h;,, H, Q;;,), where h;, is a conformal mul- 
(hjx, H, Q; p jks 41, 5h j 
tiple of hx: 


(9.21) hee =O hap, 


#7 is unchanged, and Q; ; 18 asmooth multiple of @;;, involving a different power 
of vy, as we will see below. Then (9.12)—(9.13) hold for this new triple, provided 
(py satisfies a certain elliptic PDE, which we proceed to derive. 

For these calculations, denote covariant differentiation associated to h and h 
by V and V, respectively. Then 


= vik jk aj + atk ak aji 
(9.22) Vi =ViQ’ +n —M)Q + (Pou —D* x) Q”, 


where the connection coefficients are related by 
nt i 2 f 93 ‘i i 
(9.23) Tin = Tn + 5 (55 Oke + 6 Oi — hyn dy). 
Consequently, for any symmetric, second-order tensor field QQ", 
= aijk jk l0jk Qos 
Vi? =ViQr + ral One — ook Oy. 


The last term vanishes if Q,,” = 0. If, furthermore, o” = ¥Qs*, then 


aig ” 10 . 
(9.24) Vid" = vv.Q" + (aiv+ Pay) or. 
The last term here vanishes if 7) = y~ 1°, so we have 

Aik 10 7k ~~ yA * _ .-10 jk 
(9.25) Q =p Qh = Vid =~ OVEQ 


when @,, is symmetric and has trace zero. Note that (9.25) implies Qik = 
Qik. 

Thus, if Qj, is constructed as above, as an element in the range of Po, so it 
has trace zero and solves VQ," = 0, we also have VeQ;* = 0 whenever hx, is 


related to hj, by (9.21). The scalar curvatures of (5h) and (S, h) are related by 
(9.26) 5s = "Ss — 8p “Ag, 


where A is the Laplace operator on (5, h). Now we want to satisfy the analogue 
of (9.12), namely, 
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_ — xjk _ a = 
(9.27) Ss =Qj,.Q — 6H? = 9 Qj,.Q"* — 6H? = pf — 6H’. 


By (9.26), we want ~ to be a smooth, positive solution to the PDE 


1 1, _ 3 
(9.28) Ap = sSse— gle "+ GH’e = Fla.) 
Equations of this form are discussed in § 1 of Chap. 14. 
If f > 0 on S and H F 0, then Theorem 1.10 of Chap. 14 implies the solvabil- 
ity of (9.28), which is a special case of (1.50) of Chap. 14. We thus have 


Proposition 9.2. Let S be a compact, connected 3-manifold, with metric tensor 
hjr, Qjx a smooth, divergence-free, trace-free section of S?T* (S), and let H be 
a nonzero constant. Then there exists a positive p € C™(S) such that if 


(9.29) hg=@ he, On= oO One 


then there is a Ricci-flat Lorentz manifold M, containing S' as a spacelike hyper- 
surface, with induced metric hj, and second fundamental form 


(9.30) K jn = Q5, + Ahyk, 
provided Qj QJI* = f is not identically zero on S. 


Proof. The argument above gives the result provided f(x) > 0 on S. It remains 
to weaken this condition on f. First, if the scalar curvature S'g of (5, h) is negative 
on {x € S: f(x) = 0} = &, then Theorem 1.10 of Chap. 14 still implies the 
solvability of (9.28). On the other hand, we can make a preliminary conformal 
deformation of the metric tensor of S to make its scalar curvature negative on any 
proper closed subset of 5’, such as /, as long as © is not all of S, so Proposition 
9.2 is proved. 


If Q jx is taken to be zero, then (9.28) becomes 


1 3 
(9.31) Ay = 35s? - ge. 


Integrating both sides, we see that if there is a positive solution y, then 
[ ss(@e) dV(x) <0. 
Ss 


In particular, (9.31) has no positive solution if Ss > 0 on S. Here is a positive 
result: 


9. Geometry of initial surfaces 733 


Proposition 9.3. Let S be a compact, connected 3-manifold with metric tensor 
hjx. Assume the scalar curvature of (S,h) satisfies Ss(x) < 0 on S. Let H be 
a nonzero constant. Then there is a positive p € C°(S) such that if hj, = 
pthyr, then there is a Ricci-flat Lorentz manifold M, containing S as a spacelike 
hypersurface, with induced metric h; x and second fundamental form 


(9.32) K jn = Hix. 


Proof. The equation (9.31) has the same form as (1.49) in Chap. 14, as the equa- 
tion for the conformal factor needed to alter (S,h), with scalar curvature Sg, to 
(S,h), with scalar curvature Sy = —3H?. Thus solvability follows from Propo- 
sition 1.11 of Chap. 14. 


If H = 0, then (9.28) becomes 
1 1. _? 
(9.33) Ay= 5S5sp— fe: 


where we recall that f = Qj,Q/*. The solvability of (9.33) implies the identity 
JgSse dV = [5 fy~' dV, so there is no positive solution if Ss < 0 on S. On 
the other hand, Theorem 1.10 of Chap. 14 applies if Sis(a) > 0 on S and f > 0 
on S, so we have the following: 


Proposition 9.4. Let S be a compact, connected 3-manifold with metric tensor 
hyn. Assume the scalar curvature of (S,h) satisfies Ss(x) > 0 on S. Let Q;x be 
a smooth, divergence-free, trace-free section of S*T*(S). Then there is a positive 
yp € C®(S) such that if hj and Qjx are given by (9.29), then there is a Ricci- 
flat Lorentz manifold M, containing S as a spacelike hypersurface, with induced 
metric hj x and second fundamental form 


(9.34) Kia Oy 


provided Q jx, is nowhere vanishing. In such a case, the mean curvature of S C M 
vanishes. 


Next, following [CBIM], we extend Proposition 9.3 to allow H to be non- 
constant, provided it does not vary too much. As before, we have a compact, 
3-manifold S, with Riemannian metric tensor h, scalar curvature Sg. Let H 
be a smooth, real-valued function on S. We want to construct a positive yp € 
C™(S') and a second-order, symmetric, trace-free tensor field Qj, on S' (ie., 
Q € C™(S, S2T*)) such that if we change the metric tensor to hik = p*hin, and 
set Q; k= pe ?Q; k» as in (9.29), then S is a spacelike hypersurface of a Ricci-flat 
Lorentz 4-manifold, with induced metric h; , and with second fundamental form 
K jn = Qj, + Hhjx, as in (9.30). 

In order to achieve this, we need to satisfy the Gauss—Codazzi system 
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Ss—-Qy,Q +6H? =0, 


(9.35) Bui _ 
VQ," — 2V;H =0, 


which one converts to 


1 1 5 
Ay = =Ssy — =|Q/?—"' + —H’ yp? 
p= 2Sse— ZlQl'e +7 


ViQh — 2p%h* Hy, = 0, 


(9.36) 


where |Q|? = Qin Q?*. As before (and as noted in [Yo3] and [CBY]), we get from 
(9.35) to (9.36) via the identities (9.25) and (9.26). The first equation in (9.36) is 
the same as (9.28). 

The second equation in (9.36) is 


(9.37) Di. pQ + 2y° grad H = 0. 
We look for Q in the form 
(9.38) Q=DrrX+Q’, DrrQ’ =0, 


where X is a vector field on S. As in (9.14), Drr-X = Def X — (1/3) (div X)h. 
Pick Q° € R( Po). Then (9.37) becomes 


(9.39) LX = —2y° grad H, 
where 
1 
(9.40) LX = DppDrrX = —div Def X + 3 grad div X. 


Assume that £ is invertible, that is, (S,) has no conformal Killing fields. Thus, 
given y, we solve (9.39) for X, X = —2£~'(y® grad H). Then it remains to 
solve for vy: 


1 il 3 
(9.41) Ap = 5Sse— sIE(e) + Q"l'p "+ 5H”, 
where 
(9.42) Eu=-—2DprpoLl7'(ugrad HH), EC OPS™'. 


This has a slightly more complicated form than (9.28), though of course it reduces 
to (9.28) when A is constant. 
We will establish the following: 
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Proposition 9.5. Let S be a compact 3-manifold with metric tensor h. Assume 
the scalar curvature Sg < 0 on S. Let H € C™(S) be a given positive function. 
Then we can find a positive p € C®(S), solving (9.41), with Q° = 0, provided 


(9.43) |DrrL~*ulln~ < Aollullz, 
with 
(9.44) Ao||VH lL < V3H nin: 


As a preparation to proving this, we first obtain some a priori estimates for a 
positive solution y € C'™(S) to (9.41), with Q° = 0, that is, to 


(9.45) Ay = f(x, E(¢*), 9), 
where 

6 1 1 6\|2,.—-7 3 2D 
(9.46) f(z, E(v"), 9) = 35s" — glE Pel + 5H’ e. 


Note that if y(2%0) = Ymin > 0, then Ay(2o) > 0, so if (9.45) holds, then 
f (xo, E(y*), v(xo)) > 0. Hence 


3 1 1 
(9.47) 5 4(x0)*p(a0)” > g(-Ss(20)) e(w0) 2 grv(ao) 
if Sg < —o < 0onS. This implies 
4_ 1 -2 
(9.48) pa on S; ag= 79 7 Himax: 


We next derive an upper bound on a solution to (9.48), using the hypothesis 
(9.43)-(9.44). Suppose (1) = Ymax- Then Ay(21) < 0, so if (9.45) holds, then 
f (x1, E(y®), o(21)) < 0. Now 


f(x, E(¢*), y(21)) 
(9.49) 3 1 1 
= Yin | 5H (01) Phas — ZIE(P®)? + 285 (01) Pha 
2 8 8 
The hypothesis (9.43)—(9.44) implies 
(9.50) |E(P)I|z~ S 2Aol|VA||z=|Ie°llz~ < 2V3HminPmax: 


sO 
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3 1 
5A (@1)°Pimax - cael > 1Pnaxs 


(9.51) 1 
n= 5 (8H 2a — ABIIVH ix) > 0, 
and hence 
6 5 1 
(9.52) f (x1, E(y ), p(21)) 2 1Pmax + 35s(11)Pmax- 


If the left side of (9.52) is < 0, this requires np>,,, + (1/8) S's(21) max < 0. Thus, 
if |S's| < -y, a solution to (9.45) must satisfy 


4 
; < (=o = us 
(9.53) PSMON Py Oa SH Vales 


min 


For the rest of the discussion, we modify the formula (9.46), replacing it by 


d 1 3 
(0.54) f(#, B(v"), ») = 3 Ssuly) — gIB)Ialy) + 5H’ ¢", 
where 
=, = ao, 
0.55) Lp) me yp > ao 
a(y) =Y ’ yp = ao. 


In addition, we require the functions j: and a to be smooth and monotonic on R, 
for a to be linear on y < 0, for u(y) > vy on R, and for p(y) to be some positive 
constant (say j19) for p < ao/2. 

To prove Proposition 9.5, we will use the Leray—Schauder fixed-point theorem, 
in the following form (cf. Theorem B.5 in Chap. 14): 


Theorem. Let V be a Banach space, F : [0,1] x V + V a continuous, compact 
map such that F(0,v) = vo, independent of v. Suppose there exists M < co such 
that, for all (r,x) € [0,1] x V, 

(9.56) F(7t,2) =x => |a|| < M. 


Then F: V > V, given by F(a) = F(1, 2x), has a fixed point. 
We will apply this to V = C(S') and 


(9.57) F(r,~) = (A-1)7'(%-(y) - ¢), 


where, picking b = (ao + a1)/2, we set 


(9.58) V-(y) = (1—7)(p — b) + rf (a, E(y®), 9), 
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with f as in (9.54). Note that 
(9.59) F(0,¢) =—(A—-1)"b=8. 
Also, 
(9.60) F(, 9) =p = Ap=rf(a, E(y’),¢) + —7)(v - 2). 


To check (9.56), we need to estimate Ymax and Ymin Whenever 7 € [0,1] and 
y satisfies (9.60). The case 7 = 0 is clear, so we may assume 7 > 0. If y(ao) = 
min, then V(y) > 0 at 2, so 


(9.61) Tf (xo, B(y*), (20) + (1 — )(y(20) — b) > 0. 


If y(xo) < band € (0, 1], this requires f(xo, E(y*), y(a0)) > 0, or (in place 
of (9.47)) 


3 1 
(9.62) 5/1(©0) nin 2 3 (—Ss (a0) H(Pmin)- 


This forces p?,, > (0/12) H52u(Pmin), Which in turn forces ~min > 0. Since 
u(y) > yand —S‘g > 0, this implies (9.47). Therefore, again we get the estimate 
(9.48). 


If (x1) = Ymax, then V,(~) < 0 at x1, so 
(9.63) Tf (x1, E(y®), p(ai1)) + (1 — 7) (y(a1) — 6) <0. 


If y(z1) > band 7 € (0, 1], this requires f(a, E(y®), p(a1)) < 0, which is 
equal to (9.46) if Gmax > b, so as before we have the estimate (9.53). 

Thus the fixed-point theorem applies to our situation, so Proposition 9.5 is 
proved. Thus, in rough parallel with Proposition 9.3, we have the following: 


Proposition 9.6. Let S be a compact 3-manifold with metric tensor hj;,. Assume 
the scalar curvature Ss < 0 on S. Let H € C™(S)) be a positive function satis- 
fying the hypothesis (9.43)-(9.44). Then there is a positive p © C™(S) such that 
if hix = p*hjk there is a Ricci-flat Lorentz 4-manifold M, containing S as a 
spacelike hypersurface, with induced metric hj, and second fundamental form 


(9.64) K jn = Q5, + Ahyk, 
where 
(9.65) Q = -2y7? -Drpl'(y®VH). 


In particular, the mean curvature of S C M is H. 
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See [CBY] for a discussion of some cases where S is diffeomorphic to R°, 
and asymptotically flat, and [CO] for a further analysis, when also H = 0. In this 
case, one can make a preliminary conformal change of metric to achieve Siz = 0, 
so that (9.33) becomes 


1 
Ay = —-=fy". 
~ tha 


Exercises 


1. Show that the identities (9.3) and (9.6) for G°,| s imply Lemma 8.2. 

2. Put together Proposition 9.4 and Birkhoff’s theorem, and make some deductions, 
regarding (Qj) and solutions to (9.33), and symmetry properties that they cannot have. 

3. Suppose that one wants to solve (1.1), namely, 


(9.66) Cie ae 
Show that the equation (9.7)-(9.8) on S' get replaced by 


Ss — K;,K?* + K9,K*), = 2p, 


(9.67) k 
kK; ik 3H; = Jj, 
where 
(9.68) p=8nToo| 5, Jy = —8xTo,| 5 


Study this system, particularly in the case H = const. 
4. Extend Proposition 9.5 as follows. Replace (9.43)-(9.44) by the hypothesis that, for 
some p > 3, 


(9.69) |DrrL‘ulltoo < Apllullze, 
with 
(9.70) Ap||VHA||z2 < V3 Hin. 


5. Note that solving the constraint equations (9.7)-(9.8) with Kj, = 0 is reduced to 
solving (9.33) with f = 0, namely, 


1 
Ag = <Ssy. 
pe 


Consider solutions to this, both on compact and on noncompact S.. Relate the solution 


M 
y(rz) =1+ — 
(x) iz] 


on flat R? (outside the origin) to the Schwarzschild metric. 
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10. Time slices and their evolution 


Suppose MM is a Lorentz manifold whose metric satisfies the empty-space Einstein 
equation (8.1), and on MM we have a smooth function ¢ such that tT = grad t¢ is 
timelike. Thus M is foliated by the surfaces S., on which t = c, called time 
slices. One can choose local coordinates (t,x1, 22,23) on an open set O Cc M 
with respect to which the metric is 


3 
(10.1) ds? = —X(t, x)? dt? + S© gjx(t, x) dx; dag. 
j,k=1 


This can be done by picking local coordinates (x1, x2, #3) arbitrarily on one slice 
S, and then taking x; to be constant on each integral curve of 7 through such a 
coordinate patch. The function 


(10.2) Mt, 2) = [-(r,7)] 7 


is called the lapse function of this foliation. Note that (g;,(c,x)) defines the Rie- 
mannian metric induced on S, and that N = —\r = A~1 0/0t is a unit timelike 
normal to S,. 

Each S, has a second fundamental form K jz, (c, x), and, by Proposition 9.1, for 
each c, Kj, = K;,,(c) must satisfy the constraint equations (9.7)-(9.8), where 
the covariant derivatives are given by the Riemannian metric on S,. Note that 


95k 


(10.3) DE 


= —2X0K jx. 
The following identity complements the constraint equation: 


Proposition 10.1. [f the Einstein equation (8.1) holds, then 


OK jx 
at 


(10.4) = —Yajse + A(Rich, + 83H K jn — 2K cK ";). 


Proof. Calculating the components RM. 0 Of the Riemann tensor of M, in coor- 
dinates (,..., 23), with xp = t, one obtains, for 1 < j,k < 3, 


(10.5) rN? Riono = AW Aaja + AT! OK Gk + KG eK. 
Now (9.1) implies 
(10.6) A? Riko = Rich, — Rici, + 3HK jx — KjeK'x, 


so, for any metric of the form (10.1), we have 
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OK jk : .M 
(10.7) a = ajar + A(Ric}, + 83H K jx — 2K jeK"),) — ARicjp. 
This proves (10.4). 


Note that 0,(Kj") = g**(O,K jc) + (Or.g**) K je. Using (10.3)-(10.4), we have 


OK;* sk Sk k 
(10.8) a —d,3" + A(Ric7” +3HK;"). 
Taking the trace yields 
ial 
(10.9) 3a = —Ad+A(S5 + 9H”). 


The importance of the evolution equations (10.3)-(10.4) is highlighted by the 
following result: 


Proposition 10.2. If the evolution equations (10.3)-(10.4) hold for0 <t < T 
and the constraint equations (9.7)-(9.8) hold at t = 0, then the Einstein equation 
(8.1) holds for t € [0,1] (and hence so do the constraint equations). 


Proof. To begin, from (10.7) we see that if (10.4) holds, then Ric}; = 0, for 
t € [0,7], 1 < j,k < 3. Hence, in view of the form of the metric (10.1), 


(10.10) Ric;* =0, for 0<t<T,1<j,k <3. 


From now on we drop the M/ from Ric™ . It remains to show that Rico” = 0 for 
0<t<T,0<k<3. 

We will obtain a first order 4 x 4 system for Rico”, making use of the Ricci 
identity 


(10.11) Ric;*.. = aoe 

which gives 

(10.12) Ric;°,9 = —Ric;*,1 — Ric;?,2 — Ric;?,3 + 5 uy 
By (10.10) we have 

(10.13) Su =Rico®, Si.j = Oj;Rico®. 

Now, if I’ jk are the connection coefficients of M/, we have 


(10.14) Ric;*,¢ = ORic;* + I*¥ peRic;” — I" jeRicm”*. 
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Thus, again by (10.10), 
(10.15) 1< j,k <3 => Ric,*,, =D *gRic,;® — T° ;pRico*. 
Hence, for1 <j < 3, 


3 
1 ae . 
(10.16) Ric;°.9 = 5 0jRico® = S (T*o,Ric;° — T°;,Rico*), 
k=1 


and we can replace the left side of (10.16) by 
(10.17) Ric;°,9 = Rico? +T°moRic;” —I'™ joRicm®. 


It is convenient to rewrite the terms on the right side of (10.12) when 7 = 0, 
using 


(10.18) Rig? =o" Rit, = op Rita — og “Rica, 
We obtain Ric;’., = g/'g"Ric,,'.,, and hence 
(10.19) . . 
Rico’, = —A 29°" O,Ricn® — A729?" 1 gRicn’ +A 2g?" T mgRice 


Consequently, we obtain a 4 x 4 system of the form 


3 
Rico? = 2? S° gh” J,Ricm” + A(Ric), 
(10.20) m,k=1 


1 
O;Ric;° = 5 0jRico® +B,(Ric), 1<j<3, 


where A(Ric) and 8; (Ric) are linear in Ric. The system (10.20) is readily seen to 
be a linear, symmetrizable hyperbolic system; compare with the treatment of (3.3) 
in Chap. 16. Thus the quantities Ric;° vanish identically provided they vanish at 
t = 0. But the hypothesis that the constraint equations hold at t = 0 is equivalent 
to G;° =Oatt=0,0 <j < 3, by (9.3)-(9.6), and together with (10.10) this 
implies S;, = 0 at t = 0, and hence that 


(10.21) Rie,” =0° at t=0,0<7 <3. 


This finishes the proof of Proposition 10.2. 


Note that if we regard the lapse function \ as an unknown, as well as gj, and 
Kjx, which are 3 x 3 symmetric matrices, then (10.3)—(10.4) is a system of 12 
equations in 13 unknowns. This underdetermined property is a consequence of 
one’s ability to perform an arbitrary change of t-variable (t’ = t’(t)) without 
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affecting the foliation of M. We might insist that \ = 1, thus producing a deter- 
mined 12 x 12 system, but that would lead to breakdown in finite time, since then 
(10.9) and the constraint equation (9.7) would imply 


OH 
Soa he” SoH 


This breakdown might well be due to a bad choice of coordinates rather than to 
an actual breakdown for Einstein’s equations. 

Instead of trying to specify X(t, x) a priori, one might impose an extra equa- 
tion involving . In [CBR] the following approach is taken. Let (ejn(x)) be an 
arbitrary Riemannian metric on S, and set 


(10.22) Nt, x) = e(t, c)—/? g(t, x) 1/2, 
where e = det(e;,), and g = det(g;;,). If we set 
(10.23) kee Akg, b= bh eS Sak, 


a computation yields, for a metric of the form (10.1), the identity 


ig + 3k Ric? 76 — 2B 4'5 ktm — 2k Ric? 45 — 2ayViyk 
(10.24) — ARV a; — kegV ya! — 08V uhag — 3kdlgay — 00-2 halepene™ 
= ke Ric™ jf 42k Ric™ ;; _ Vi{ rane} _ O,Ric™ j;, 


when X satisfies (10.22), where 
(10.25) a; = A" OA, 


and V ; denotes the Levi—Civita connection on S;, associated with the metric ten- 
sor (gi;(t)), and Ok;, is given by 


(10.26) kj =X 2 OP ki = GOVE aids 
and we use the notation 
(10.27) fj) = fig + Tye 


Now, when the right side of (10.24) is required to vanish, we can couple the 
resulting equation to (10.3), obtaining the system 


(10.28) O:9i3 = — 2ki;, 
kaj = Ske Ric® yf + pat ta aa + 2k Ric? ;; 
(10.29) + 2a;Vjyk + 4kV ia; + keV ja" 
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+ a’Veki; + 3kazaj + 4d *kiekjmk™. 


As before, is given by (10.22). This system has a hyperbolic character (compare 
with the discussion of the system (8.37)). The initial-value problem is well posed, 
and one has finite propagation speed. Furthermore, as shown in [CBR], if such a 
system is satisfied, then if we use the Lorentz metric (10.1), the Einstein tensor 
Gin = G™ for this metric satisfies a homogeneous linear hyperbolic system of 
the following form (here, 1 < j,k, @ < 3): 


a,G7* + 2(VIG" + VFG? — g/*V.G") =0, 
(10.30) G+ f7(G"”, VG") =0, 
a:G° + VjG? =0. 


Here, fh is linear in its arguments. In the middle equation, 0 < p,v < 3, while 
1 < j,k,@ < 3. Note that the last equation in (10.30) is just part of (1.55), a 
consequence of the Bianchi identity. 

Suppose now that the system (10.28)—(10.29) is satisfied and also that, att = 0, 
the constraint equations (9.7)—-(9.8) and the (10.4) hold. The constraint equations 
give directly that G°? = 0 at t = 0, for 0 < 7 < 3. In view of (10.7), the equation 
(10.4) implies Ric}, = 0 at t = 0, for 1 < j,k < 3. Now 


—2p:.M ik pi.M 
Su = —A “Ricog + y g’” Rici;, 
1<9,KS3 


and Ricgs = Goo — (1/2)\7Sv, so we deduce that Sy = 0 at t = 0, and hence 
(10.31) GF—0, att=0, 0<j,k<3. 


Now the identity (1.55) (a consequence of Bianchi’s identity) implies 


3 
(10.32) AG? +S°Vv.G* =0 on M, 0<j <3. 
k=1 


In concert with (10.31), this implies 
(10.33) A.G2°=0, at t=0, 


for 0 < 7 < 3. Thus all the Cauchy data for (10.30) vanish at ¢ = 0. Thus, under 
our current hypotheses, we have a solution to Einstein’s equations G/* = 0 on M. 

Another approach makes use of “maximal slicing,” namely, requiring H = 0. 
In view of (10.9), this requires AX = SA. Now, using (9.7), we see that when 
H =0, Ss = Kj, KI" = |K|?, so we hence have the lapse equation 


(10.34) Rie KPO: 
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This has no nontrivial solution if S is compact, but it does if S is unbounded and 
“asymptotically flat’ The use of the evolution equations with maximal slicing 
plays an important role in [CK]. 

The use of the evolution equations, with various approaches to the lapse func- 
tion, and also some variants, involving a “shift vector,’ has played an important 
role in numerical work. A number of papers on this can be found in [EFH]. We 
also mention the recent work [CBY2], which has implications for both the theo- 
retical and the numerical study of Einstein’s equations. 


Exercises 


1. Derive the curvature identity (10.5). 
2. Show that if S is a three-dimensional Riemannian manifold, with metric tensor g;x, 
then 


: : ‘ : 1 
Re i = gikRicy + g;eRics, — gjxRici — gitRic;, - 379s (gingie — 95k Gie)- 
3. Rewrite the proof of Proposition 10.2 in a coordinate-invariant manner. Show that Rico® 
defines a t-dependent family of 0-forms r on S}, that Ric;°, 1 < j < 3, defines a 
family of 1-forms p on S;, and that (10.20) can be written in the form 


Or = 2076p + A(r, p), 


1 
Op = ral + Br, p); 


where 6 : A'(S;) + A°(S;) is determined by the Riemannian metric on each slice S¢. 
Here, each S; is identified with a single slice S., via the vector field rT. 
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